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HIV ENVELOPE POLYPEPTIDES AND VACCINE 

B ACKGROUND OF THE INVENTION 

5 fjgTd of »h» TP vention . . 

This invention relates to HIV envelope 
polypeptides and vaccines containing the polypeptides. 

r. ac rr1 ption nf the Related Art 
10 Acquired immunodeficiency syndrome (AIDS) is 

caused by a retrovirus identified as the human 
immunodeficiency virus (HIV) . There have been intense 
efforts to develop a vaccine that induces a protective 
immune response based on induction of antibodies or 
15 cellular responses. Recent efforts have used subunit 
vaccines where an HIV protein, rather than attenuated 
or killed virus, is used as the immunogen in the 
vaccine for safety reasons. Subunit vaccines generally 
include g P 120, the portion of the HIV envelope protein 
20 which is on the surface of the virus. 

The HIV envelope protein has been extensively 
described, and the amino acid and nucleic acid 
■ sequences encoding HIV envelope from a number of HIV 
strains are known (Myers, G. et al., 1992. Human 
25 Retroviruses and AIDS. A compilation and analysis of 
nucleic acid and amino acid sequences. Los Alamos 
National Laboratory, Los Alamos, New Mexico). The HIV 
envelope protein is a glycoprotein of about 160 Jed 
(gpl60) which is anchored in the membrane bilayer at 
30 its carboxyl terminal region. The N-terminal segment, 
gpl20, protrudes into the aqueous environment 
surrounding the virion and the c-terminal segment, 
g P 41, spans the membrane. Via a host-cell mediated 
process, gpl60 is cleaved to form gpl20 and the 
35 integral membrane protein gp41. As there is no 

covalent attachment between gpl20 and gp4l, free gpl20 
is sometimes released from the surface of virions and 
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infected cells. 

The gpl20 molecule consists of a polypeptide core 
of 60,000 daltons which is extensively modified by 
N-linked glycosylation to increase the apparent 
5 molecular weight of the molecule to 120,000 daltons. 
The amino acid sequence of gpl20 contains five 
relatively conserved domains interspersed with five 
hypervariable domains. The positions of the 18 
cysteine residues in the gpl20 primary sequence, and 

10 the positions of 13 of the approximately 24 N-linked 

glycosylation sites in the gpl20 sequence are common to 
all gpl20 sequences. The hypervariable domains contain 
extensive amino acid substitutions, insertions and 
deletions. Sequence variations in these domains result 

15 in up to 30% overall sequence variability between gpl20 
molecules from the various viral isolates. Despite 
this variation, all gp!20 sequences preserve the 
ability of the virus to bind to the viral receptor CD4 
and to interact with gp4l to induce fusion of the viral 

20 and host cell membranes. 

gpl20 has been the object of intensive 
investigation as a vaccine candidate for subunit 
vaccines, as the viral protein which is most likely to 
be accessible to immune attack. At present, clinical 

25 trials using gp!20 MN strain are underway. However, to 
date no human vaccine trial has been of sufficient size 
to confirm or refute vaccine efficacy. 

The development of candidate HIV-1 vaccines is 
burdened by the lack of in vivo or in vitro models of 

30 HIV-1 infection that accurately approximate the 

conditions of natural infection in humans. Several 
candidate HIV-1 vaccines (Berman et al.; J. Virol. 
7:4464-9 (1992); Haigwood et al.; J. Virol. 66:172-82 
(1992); Salmon-Ceron et al.; AIDS Res. and Human 

35 Retroviruses 11:1479-86 (1995)] have been described 

that elicit broadly cross-reactive antibodies able to 
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neutralize a variety of diverse HIV-l isolates in 
vitro. However, the relevance of in vitro assays to 
protective immunity in vivo is uncertain. Although 
several vaccines have provided chimpanzees with 
5 protection from challenge by homologous and 

heterologous strains of HIV-l, protection has not 
always correlated with in vitro neutralization assays 
carried out in T cell lines, or in lectin- and 
cytokine-activated peripheral blood mononuclear cells 

10 (PBMCs) [Berman et al.; Nature 345:622-5 (1990); Bruck 
et al.; Vaccine 12 (12) : 1141-8 (1994); El-Amad et al.; 
AIDS 9:1313-22 (1995); Girard et al.; J . Virol. 
69:62 39-48 (1995); and Fulz et al; Science 
256:1687-1690 (1992)]. While successful protection of 

15 chimpanzees is encouraging and has historically proved 
to be a reliable indicator of vaccine efficacy, the 
conditions of infection in all experimental models of 
HIV-l infection differ significantly from natural 
infection in humans. 

20 Experimental HIV-l infection in vivo and in vitro 

both suffer from the limitation that the in vitro 
amplification of HIV-l, which is required to prepare 
virus stocks for in vitro or in vivo infectivity 
experiments, imposes a genetic selection that results 

25 in a spectrum of virus quasi-species that differ from 
the spectrum of variants present in the clinical 
specimens used to establish the culture [Kusumi et al. ; 
J. Virol. 66:875 (1992); Meyerhans et al.; Cell 
58:901-10 (1989)]. Because of these uncertainties, and 

30 even greater uncertainties related to the amount of 

virus transmitted, the site and cell type involved in 
initial replication, and the kinetics of virus 
dissemination, the ability of currently available in 
vitro or in vivo assays to reliably predict vaccine 

35 efficacy is questionable. 

One of the candidate HIV-l vaccines that have 
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entered human clinical trials is recombinant gp!20 
prepared in Chinese hamster ovary (CHO) cells from the 
MN strain of HIV-1 (MN-rgpl20) (Berman et al.; 
J. Virol. 7:4464-9 (1992)). To date, approximately 499 
5 adults have participated in Phase 1 and 2 

iramunogenicity and safety trials of this vaccine. The 
data collected thus far suggest that MN-rgpl20 is safe, 
immunogenic, and elicits high titers of neutralizing 
antibodies in greater than 95% of individuals immunized 

10 according to a 0, 1, and 6 month immunization schedule 
[Belshe et al.; JAMA 272 (6) : 475-80 (1994); McElrath; 
Seminars in Cancer Biol. 6:1-11 (1995)]. However, 
during the course of these trials, nine vaccinees who 
received MN-rgpl2Q have become infected with HIV-l 

15 through high risk behavior. Small trials, such as 

these, in populations with low rates of infection and 
minimally sized placebo control groups do not have 
sufficient statistical power to confirm or refute 
vaccine efficacy. 

20 However, effective vaccines based on gpl20 or 

another HIV protein for protection against additional 
strains of HIV are still being sought to prevent the 
spread of. this disease. 

25 Description of the Background Art 

Recombinant subunit vaccines are described in 

Berman et al . f PCT/US91/02250 (published as number 

W091/15238 on 17 October 1991). See also, e.g. 

Hu et al., Nature 328:721-724 (1987) (vaccinia virus- 
30 HIV envelope recombinant vaccine); Arthur et al., 

J. Virol. 63(12): 5046-5053 (1989) (purified gpl20) ; 

and Berman et al., Proc. Natl. Acad. Sci. USA 

85:5200-5204 (1988) (recombinant envelope glycoprotein 

gpl20). 

3 5 Numerous sequences for gpl2 0 are known. The 

sequence of gpl20 from the IIIB substrain of HIV-1^ 
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referred to herein is that determined by Muesing et 
al., "Nucleic acid structure and expression of the 
human AIDS/lymphadenopathy retrovirus, Nature 
313:450-458 (1985). The sequences of gpl20 from the 
5 NY-5, Jrcsf, Z6, Z321, and HXB2 strains of HIV-1 are 

listed by Myers et al., "Human Retroviruses and AIDS; A 
compilation and analysis of nucleic acid and amino acid 
sequences," Los Alamos National Laboratory, Los Alamos, 
New Mexico (1992) . The sequence of the Thai isolate 

10 A244 is provided by McCutchan et al., "Genetic Variants 
of HIV-1 in Thailand, 1 * AIDS Res. and Human Retroviruses 
8:1887-1895 (1992). The MN l9M clone is described by 
Gurgo et al., "Envelope sequences of two new United 
States HIV-1 isolates," Virol. 164: 531-536 (1988). As 

15 used herein, MN, MN-rgpl20, the MN clone or isolate 
refers to MN ONE . The MN^ amino acid sequence is 
Sequence ID No. 29. 

Each of the above-described references is 
incorporated herein by reference in its entirety. 

20 

Summary of the Invention 
Oligonucleotide sequences encoding gpl20 
polypeptides from breakthrough isolates of vaccine 
trials using MN-rgpl20 and the encoded gpl20 

25 polypeptides are provided. Use of the gpl20 

polypeptides from one or more of the isolates in a 
subunit vaccine, usually together with MN-rgpl20, can 
provide protection against HIV strains that are 
sufficiently different from the vaccine strain (e.g.; 

30 MN-rgpl20) that the vaccine does not confer protection 
against those strains. Antibodies induced by the 
polypeptides are also provided. 

Bri ef Description of the Drawings 
35 Figure 1 illustrates the kinetics of antibody 

response to MN-rgpl20 in vaccinees infected with HIV-1. 
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Sera were collected at the time points indicated and 
assayed for antibodies reactive with MN-rgpl20 (open 
circles) or a synthetic peptide derived from the V3 
domain of MN-rgpl20 (closed circles). Arrows indicate 
5 dates of injection. Plus sign indicates the first time 
HIV-l infection was detected. Shaded area indicates 
data collected after HIV-l infection. Data from 
vaccinee C6 is shown in panel A; C8 in panel B; C7, 
panel C; Cll, panel D; CIO, panel E; C17, panel F; and 
10 C15, panel G. 

Figure 2 illustrates the kinetics of CD4 blocking 
antibody response in vaccinees infected with HIV-l. 
Sera were collected at the time points indicated and 
assayed for antibodies able to block the binding of 
15 [ i35 I] -labeled MN-rgpl20 to cell surface CD4 . Arrows 
indicate dates of injection. Plus sign indicates the 
first time HIV-l infection was detected. Shaded area 
indicates data collected after HIV-l infection. Data 
from vaccinee C6 is shown in panel A; CS in panel B; 
20 C7, panel C; Cll, panel D; CIO, panel E; C17, panel F; 
and C15, panel G. 

Figure 3 illustrated predicted amino acid 
sequences- of envelope glycoproteins (gpl20) from 
breakthrough viruses. Proviral DNA sequences were 
25 amplified by PCR from PBMCs and cloned into the PRK5 
expression plasmid. Two clones from each infected 
vaccinee were sequenced from double stranded plasmid 
DNA. Sequence numbering is with reference to the 
initiator methionine residue of gpl20. For the purpose 
30 of comparison, the sequences shown begin at amino 
acid 12 of the mature, fully processed, envelope 
glycoproteins (corresponding to position 41 of the 
gpl20 open reading frame) . Shaded areas indicate 
sequences at neutralizing epitopes, dark boxes indicate 
35 polymorphisms thought to be important for the binding 
of virus neutralizing MAbs reactive with MN-rgpl20. 
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Conserved (C) regions and variable (V) regions are 
indicated above the sequences. Boxes indicate sequence 
homologies and polymorphisms. 

Figure 4 illustrates immunoprecipitation of 
5 recombinant gpl20 prepared from breakthrough viruses. 
Recombinant gpl20s from the seven breakthrough viruses 
were prepared by transient transfection of 293s cells. 
Cells were metabolically labeled with M S methionine and 
growth conditioned cell culture supernatants were 

10 immunoprecipitated with polyclonal antisera to 
MN-rgpl20. Immunoprecipitates were resolved by 
SDS-PAGE and visualized by autoradiography. C8 lanes a 
and b correspond to clones C8 . 3 and C8.6; C6 lanes a 
and b correspond to clones C6.1 and C6.5; C7 lanes a 

15 and b correspond to clones C7 . 2 and C7.10; C17 lanes a 
and b correspond to C17.1 and C17.3; Cll lanes a and b 
correspond to clones Cll. 5 and Cll. 7; CIO lanes a and b 
correspond to clones CIO. 5 and CIO. 7; C15 lanes a and b 
correspond to clones CIS. 2 and C15.3. 

20 Figure 5 illustrates binding of monoclonal 

antibodies to recombinant gpl20 from breakthrough 
viruses. Growth-conditioned cell culture supernatants 
were collected from 293s cells transiently transfected 
with plasmids directing the expression of breakthrough 

25 virus envelope glycoproteins. The relative rgp!20 

concentrations were determined by ELISA using MAb 5B6 
specific for the HSV-1 glycoprotein D flag epitope at 
the amino terminus of all of the rgpl20 variants 
described herein. The resulting rgpl20 preparations 

30 were captured onto wells of microtiter plates coated 
with a polyclonal antibody specific for a conserved 
sequence in the C-terminus of gp!20. The binding of 
virus neutralizing monoclonal antibodies reactive with 
gpl20 was determined by ELISA. A, binding by MAb (5B6) 

35 specific for the HSV-1 glycoprotein D flag epitope; B, 
binding by MAb (1034) against the V3 domain of 
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MN-rgpl20; C binding by MAb (50.1) raised against a 
synthetic peptide corresponding to the V3 domain of 
MN-rgpl20; D, binding by a human MAb (I5e) known to 
block the binding of gpl20 to CD4 . 
5 Figure 6 depicts the mature envelope glycoprotein 

(gpl20) from the MN clone of the MN strain of HIV-l 
(SEQ. ID. NO- 29) . Hypervar iable domains are indicated 
in bold, and the V and C regions are indicated 
(according to Modrow et al . , J . Virology 61(2): 570 
10 (1987). Potential glycosy lation sites are marked with 
a (*). 

Detailed Description of the Invention 
The present invention provides gpl20 polypeptides 

15 from breakthrough isolates of HIV vaccine trials. 

Novel oligonucleotide sequences encoding gpl20 from 
breakthrough isolates which can be used to express 
gpl20 are also provided. Use of gp!20 polypeptides 
from one or more of the isolates in a subunit vaccine, 

20 usually together with MN-rgpl20, can provide protection 
against HIV strains that are sufficiently different 
from the vaccine strain (e.g.; MN-rgpl20) that the 
vaccine does not confer protection against those 
strains. 

25 In one embodiment, the vaccine is based on the use 

of the MN-rgpl20 polypeptide (Sequence ID No. 29} and 
gp!20 polypeptides from MN-like viruses that include 
neutralizing epitopes that are not present in the 
initial vaccine strain, and are sufficiently different 

30 from those of the vaccine strain, to have been able to 
cause HIV-l infections in MN-rgpl20 vaccinated 
individuals (i.e.; to result in breakthrough 
infections) . Use of the initial vaccine strain 
empirically determines the viruses present in the 

35 population that contain additional neutralizing 

epitopes sufficiently different from those of the 
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vaccine strain to escape protection induced by the 
vaccine strain. Use of an initial representative gp!20 
polypeptide in a vaccine acts as a sieve so that 
viruses that are not effectively protected against by 
5 the vaccine strain breakthrough the vaccine, 

empirically resulting in determination of additional 
strains in a given geographic region that are not 
protected against by the initial vaccine strain. Use 
of gp!20 from those breakthrough isolates complements 

10 the vaccine isolate by providing additional 

neutralizing epitopes not present in the initial 
vaccine strain, therefore creating a more complete 
vaccine that confers protection against multiple 
different virus strains in the region. 

15 Prior HIV-1 vaccine strategies were based on 

selection of appropriate candidate vaccine polypeptides 
based on homology alignment studies. However, since 
some of the neutralizing epitopes are conformation- 
dependent and the location of all of these epitopes is 

20 not known, this approach necessarily cannot determine 
all of the neutralizing epitopes that should be 
included in a vaccine for a particular region. In 
contrast,, the present approach uses a selected 
representative strain and empirically determines 

25 strains that are sufficiently different and therefore 
breakthrough the barrier of protection provided by the 
initial vaccination program. Those strains can be 
included in the vaccine to confer more complete 
protection from HIV strains in the region. In 

30 addition, those strains can be used alone to confer 
protection against the breakthrough virus. 

In another embodiment, the invention comprises a 
vaccine containing a first HIV gp!20 polypeptide 
sequence and a breakthrough isolate HIV gpl20 

35 polypeptide sequence from a vaccinee vaccinated with a 
vaccine including the first HIV gpl20 polypeptide 
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sequence, the HIV gpl20 polypeptide sequences being in 
a suitable carrier. Fragments of one or both HIV gpl20 
polypeptide sequences can be substituted for one or 
both of the corresponding HIV gpl20 polypeptide 
5 sequences. 

Preferably, the first gpl20 polypeptide sequence 
contains neutralizing epitopes found in one or more 
gp!20 polypeptides present in isolates from the 
geographical region where the initial vaccine (i.e., 

10 the vaccine that gives rise to the breakthrough 

isolate) is administered. More preferably, the first 
gpl20 polypeptide sequence contains at least one of the 
more common neutralizing epitopes for the region, and 
most preferably the first gpl20 polypeptide sequence 

15 contains at least one of the three most common 
neutralizing epitopes. 

gpl20 polypeptide sequences suitable for use as 
the first gpl20 polypeptide sequence include gp!20 MN, 
the Thai isolate A244 sequence (hereinafter n gpl20 

20 A244") , gpl20 MN-GNE6 (Sequence ID No. 31; also known 
in the art as n gpl20 GNE6"), and gpl20 MN-GNE8 
(Sequence ID No. 33; also known in the art as "gpl20 
GNE8" ) , and the like. gpl20 MN, gpl20 MN-GNE6 , and 
gpl20 MN-GNE8 are especially preferred for use as the 

25 first gpl20 polypeptide sequence in initial vaccines 

for North America. gpl20 A244 is especially preferred 
for use as the first gp!20 polypeptide sequence in 
initial vaccines for Thailand. 

In a variation of this embodiment, the vaccine 

30 includes two different (i.e., first and second) gpl20 
polypeptide sequences, or fragments thereof, in 
combination with a breakthrough isolate HIV gpl20 
polypeptide sequence. The latter can be from a 
vaccinee vaccinated with either or both of the first 

35 and second HIV gpl20 polypeptide sequences. 

Exemplary vaccines include those containing 
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combinations of gp!20 MN, gp!20 A244, gpl20 MN-GNE6 
(Sequence ID No. 31), and gpl20 MN-GNE8 (Sequence ID 
No. 33). Combinations of gp!20 MN and gpl20 A244 or 
gpl20 MN-GNE8 (Sequence ID No. 33) with a breakthrough 
5 isolate HIV gpl20 polypeptide sequence are especially 
preferred. 

In vaccines containing gpl20 MN, the breakthrough 
isolate HIV gpl20 polypeptide sequence can be an HIV 
gp!20 polypeptide sequence selected from the group 

10 consisting of Sequence ID Nos. 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, and 28, and fragments thereof. 

The term "subunit vaccine" is used herein, as in 
the art, to refer to a viral vaccine that does not 
contain virus, but rather contains one or more viral 

15 proteins or fragments of viral proteins. As used 

herein, the term "multivalent", means that the vaccine 
contains gpl20 from at least two HIV isolates having 
different amino acid sequences. 

The term "breakthrough isolate" or "breakthrough 

20 virus" is used herein, as in the art, to refer to a 
virus isolated from a vaccinee. 

The terms "amino acid sequence", "polypeptide 
sequence"., and "polypeptide" are used interchangeably 
herein as in the art, as are the terms "nucleic acid 

25 sequence", "nucleotide sequence", and 
"oligonucleotide" . 

Polypeptides from Breakthrough Isolates 

The gpl20 polypeptides of this invention 

30 correspond to the amino acid sequences of seven 

breakthrough isolates which are illustrated below in 
Table 1. A polypeptide of this invention includes an 
HIV gpl20 amino acid sequence illustrated in Table 1 
(Sequence ID Nos. 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 

35 21, 23, 25, and 27) and fragments thereof. The 
polypeptides of this invention can include fused 
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sequences from two or more HIV gpl2 0 or gpl60 amino 
acid sequences. 

The polypeptide can also be joined to another 
viral protein, such as a flag epitope amino acid 
5 sequence. The term "flag epitope" is used herein, as 
in the art, to denote an amino acid sequence that 
includes an epitope recognized by a monoclonal 
antibody. Flag epitopes facilitate using single 
monoclonal antibody affinity purification of a 
10 plurality of different recombinant proteins, each 

having the flag epitope recognized by the monoclonal 
antibody. Numerous amino acid sequences can function 
as flag epitopes. The N-terminal sequences of Herpes 
Simplex Virus Type 1 (HSV-l) glycoprotein D (gD-1) is 
15 conveniently used as the flag epitope and its use is 

described in detail in the examples. The flag epitope 
is conveniently fused to the N terminus of the HIV 
gp!20 polypeptide sequence. Alternatively, however, 
monoclonal antibodies that recognize neutralizing 
20 epitopes in the rgp!20 sequences can be used to 

affinity purify the amino acid sequences, and a flag 
epitope can be omitted. 

In addition, various signal sequences can be 
joined to a polypeptide of this invention. Although 
25 rgpl20 is secreted to some extent in HIV cultures, the 
amount of the envelope glycoprotein released from 
(secreted by) the host cells varies widely from strain 
to strain. Various signal sequences can be introduced 
into the polypeptide by joining a nucleotide sequence 
30 encoding the signal sequence to the nucleotide sequence 
encoding the rgpl20 to facilitate secretion of rgp!20 
from the cells. For example, Chiron HIV gpl20 
polypeptides include a signal sequence from tissue 
plasminogen activator (TPA) that provides good 
35 secretion of rgpl20. Additional signal sequences are 
well known and include the N-terminal domain of murine 
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leukemia virus surface protein gp70 described by Kayman 

et al., J. Virol. 68:400-410 (1984). 

Table 1 illustrates the nucleotide and deduced 

amino acid sequences for two clones of each the seven 
5 breakthrough isolates of this invention. The clones 

are: C6.1; C6.5; C8.3; C8.6; CIS. 2; CIS. 3; CI. 2; 

C7.10; C11.5; C11.7; CIO. 5; CIO. 7; C17.1; and C17.3. 

These sequences are SEQ. ID. NOs. 1-28, the first 

sequence number for each clone being the nucleotide 
10 sequence and the second being the amino acid sequence. 

The amino acid sequence for MN and the nucleotide and 

deduced amino acid sequences for MN-GNE6 and MN-GNE8 

are illustrated in the sequence listing hereinafter. 

In the listing for MN-GNE6, a stop codon appears at 
15 amino acid residue position 51. This stop codon can be 

replaced with a codon encoding the corresponding amino 

acid from MN or MN-GNE8 or another isolate. 
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TABLE 1 
CLONE C6.1 

CGG GTA CCT GTG TGG AAG GAA GCA ACC ACC ACT CTA 36 
5 Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 

15 10 





TTT 


TGT 


GCA 


TCA 


GAT 


GCT 


AAA 


GCA 


TAT 


GAC 


ACA 


GAG 


GTG 


75 




Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 


Ala 


Tyr 


Asp 


Thr 


Glu 


Val 










15 








20 








25 


10 


CAT 


AAT 


GTT 


TGG 


GCC 


ACA 


CAT 


GCT 


TGT 


GTA 


ccc 


ACA 


GAC 


114 




His 


Asn 


Val 


Trp 


Ala 
30 


Thr 


His 


Ala 


Cys 


val 
35 


Pro 


Thr 


Asp 






CCA 


AAC 


CCA 


CAA 


GAA 


ATG 


GTA 


TTG 


GAA 


AAT 


GTG 


ACA 


GAA 


153 




Pro 


Asn 


Pro 


Gin 


Glu 


Met 


Val 


Leu 


Glu 


Asn 


Val 


Thr 


Glu 




15 




40 










45 










50 








GAT 


TTT 




ATG 


TGG 


AAA 


AAT 


GAC 


ATG 


GTA 


GAA 


CAG 




X 5 £. 




Asp 


Phe 


Asn 


Met 


i rp 


Lys 


Asn 


A9p 


Met 
60 


Val 


Glu 


Gin 


Met 






CAT 


GAG 


GAT 


ATA 


ATC 


AGT 


TTA 


TGG 


GAT 


CAA 


AGC 


CTA 


AAA 


231 


20 


His 
65 


Glu 


Asp 


I le 


He 


Ser 
70 


Leu 


i tp 


Asp 


Gin 


Ser 
75 


Leu 


L Jr 5 






CCA 


TGT 


GTA 


AAA 


TTA 


ACC 


CCA 


CTC 


TGT 


ATT 


ACT 


TTA 


AAT 


270 




Pro 


Cys 


Val 
SO 


Lvs 


Leu 


Thr 


Pro 


Leu 
85 


Cvs 


He 


Thr 


Leu 


Asn 
90 




25 


TGC 


ACC 


AAT 


TGG 


AAG 


AAG 


AAT 


GAT 


ACT 


AAA 


ACT 


AAT 


AGT 


309 






Thr 


Asn 


x rp 


Ly s 
95 


Lys 


Asn 


Asn 


Thr 


Lvs 
100 


Thr 


Asn 


Ser 






AGT 


AGT 


ACT 


APA 


ACT 


AAT 


AAT 


AGT 


AGT 


GCT 


ACA 


GCT 


AAT 


348 




Ser 


Ser 


Thr 


Thr 


Thr 


Asn 


Asn 


Ser 


Ser 


Ala 


Thr 


Ala 


Asn 




30 




105 










1 10 










115 








AGT 


AGT 


AGT 


ACT 


ACA 


ACT 


AAT 


AGT 


AGT 


TGG 


GCA 


GAG 


ATA 


387 




Ser 


Ser 


Ser 


Thr 
120 


Thr 


Thr 


Asn 


Ser 


Ser 
125 


Tro 


Gl v 


Glu 


lie 






AAG 


GAG 


GGA 


GAA 


ATA 


AAG 


AAC 


TGC 


TCT 


TTC 


AAT 


ATC 


ACC 


426 


35 


Lys 


Glu 


Gly 


Glu 


lie 


Lys 


Asn 


Cys 


Ser 


Phe 


Asn 


lie 


Thr 






130 








135 










140 










ACA 


AGC 


ATA 


AGA 


GAC 


AAG 


GTG 


AAG 


AAA 


GAA 


TAT 


GCA 


CTT 


465 




Thr 


Ser 


He 
145 




Asp 


Lys 


Val 


Lvs 
" j *■* 

150 


Lvs 


Glu 


Tyr 


Ala 


Leu 
155 




40 


TTT 


TAT 


AGC 


CTT 


GAT 


GTA 


GTA 


CCA 


ATA 


GAA 


AAT 


GAT 


AAT 


504 




Phe 


Tvr 


Ser 


Leu 


Asd 


Val 


Val 


Pro 


He 


Glu 


Asn 


Asp 


Asn 












160 










165 












ACT 


AGC 


TAT 


AGG 


TTG 


AGA 


AGT 


TGT 


AAC 


ACC 


TCA 


GTC 


ATT 


543 




Thr 


Ser 


Tyr 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


Val 


He 




45 




170 










175 










180 








ACA 


CAA 


GCC 


TGT 


CCA 


AAG 


GTA 


ACT 


TTT 


GAG 


CCA 


ATT 


CCC 


582 




Thr 


Gin 


Ala 


Cys 


Pro 


Ly9 


Val 


Thr 


Phe 


Glu 


Pro 


He 


Pro 












185 








190 














ATA 


CAT 


TAT 


TGT 


ACC 


CCG 


GCT 


GGT 


TTT 


GCG 


ATT 


CTG 


AAG 


621 


50 


He 


His 


Tyr 


Cys 


Thr 


Pro 


Ala 


Gly 


Phe 


Ala 


He 


Leu 


Lys 






195 






200 










205 










TGT 


AGA 


GAT 


AAA 


AAG 


TTC 


AAT 


GGA 


ACA 


GGA 


CCA 


TGC 


AAA 


660 




Cys 


Arg 


Asp 
210 


Lys 


Lys 


Phe 


Asn 


Gly 
215 


Thr 


Gly 


Pro 


Cys 


Lys 
220 




55 


AAT 


GTT 


AGC 


ACA 


GTA 


CAA 


TGT 


GCA 


CAT 


GGA 


ATT 


AAG 


CCA 


699 




Asn 


Val 


Ser 


Thr 


Val 
225 


Gin 


Cys 


Ala 


His 


Gly 
230 


He 


Lys 


Pro 






GTA 


GTG 


TCA 


ACT 


CAA 


CTG 


CTG 


TTA 


AAT 


GGC 


AGC 


CTA 


GCA 


738 




Val 


Val 


Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 




60 




235 










240 










245 








GAA 


GAA 


GAG 


GTA 


ATA 


ATT 


AGA 


TCT 


GCC 


AAT 


TTC 


TCA 


AAC 


777 




Glu 


Glu 


Glu 


Val 
250 


lie 


He 


Arg 


Ser 


Ala 
255 


Asn 


Phe 


Ser 


Asn 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



AAT 




& & & 

AAA 




ATA 


ATA 


GTA 


CAG 


TTG 


AGG 


GAA 


CCT 


GTA 


816 


Asn 


» 1 » 


Lys 


IIS 


Tie 


He 


Val 


Gin 


Leu 


Aro 


Glu 


Pro 


Val 












265 










270 








GAA 


ATT 


AAT 


TGT 


ALn 


AfiA 

Mwft 


nee* 


AGC 


AAC 


AAT 


ACA 


ATA 


AAA 


855 


Glu 




Asn 


Cys 




Arg 


Pro 


Ser 


Asn 


Asn 


Thr 


I le 


Lvs 








275 








280 










285 




GGT 


ATA 


CAC 


ATA 


GGA 


CCA 


GGG 


AGA 


GCA 


TTT 


TAT 


GCA 


ACA 


894 


Gly 


He 


His 


He 


Gly 


Pro 


Gly 


Arg 


Ala 


Phe 


Tyr 


Ala 


Thr 










290 










295 










GGA 


GAC 


ATA 


CCA 


GGA 


GAT 


ATA 


AGA 


CAA 


GCA 


CAT 


TGT 


AAC 


933 


Gly 


Asp 


He 


Arg 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 




300 










305 










310 






ATT 


AGT 


GGA 


GCA 


AAA 


TGG 


AAT 


AAC 


ACT 


TTA 


AAC 


AAG 


GTA 


972 


He 


Ser 


Gly 


Ala 


Lys 


Trp 


Asn 


Asn 


Thr 


Leu 


Lys 


Lys 


Val 








31S 








320 












GTT 


AAA 


AAA 


TTA 


AAA 


GAA 


CAA 


TTT 


CCA 


AAT 


AAA 


ACA 


ATA 


1011 


Val Lys Lys Leu Lys Glu Gin Phe Pro Asn Lys Thr lie 


325 








330 








335 






GTC 


TTT 


AAC 


CAT 


TCC 


TCA 


GGA 


GGG 


GAC 


CCA 


GAA 


ATT 


GTA 


1050 


Val 


Phe 


Asn 


His 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 


Glu 


I le 


Val 








340 










345 










350 




ATG 


CAC 


AGT 


TTT 


AAT 


TGT 


CAA 


GGG 


GAA 


TTT 


TTC 


TAC 


TGT 


1089 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gin 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 












355 










360 










AAT 


ACA 


ACA 


AAG 


CTG 


TTT 


AAT 


AGT 


ACT 


TGG 


AAT 


GAT 


ACT 


i no 
1 1 i b 


Asn 


Thr 


Thr 


Lys 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Asp 


Thr 






365 








370 










375 




1167 




GAG 


TCA 


AAT 


AAC 


AAT 


GAT 


AGT 


ACT 


ATT 


ACA 


CTC 


CCA 


Thr 


Glu 


C a *• 

oer 


As n 


Asn 


Asn 


Asp 


Ser 


Thr 


He 


Thr 


Leu 


Pro 










380 










385 










1206 


TGC 


AGA 


ATA 


AAA 


CAA 


ATT 


ATA 


AAC 


ATG 


TGG 


CAG 


GAA 


ATA 


Cys 


Arg 


He 


Lys 


Gin 


I le 


He 


Asn 


Met 


Trp 


Gin 


vjIU 


I le 




390 






395 










400 






1245 


GGA 


AAA 


GCA 


ATG 


TAT 


GCC 


CCT 


CCC 


ACC 


AGA 


GGA 


GAA 


ATT 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


Thr 


Arg 


Gly 


Glu 


He 




405 










410 














AAA 


TGT 


TCA 


TCA 


AAT 


ATT 


ACA 


GGA 


CTA 


CTG 


TTA 


ATA 






Lys 


Cys 


Ser 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 


He 


Arg 








420 










425 










GAT 


GGT 


GGT 


ATT 


AAC 


ACT 


AGC 


GAT 


GCC 


ACC 


GAG 


ACC 


TTC 


1323 


Asp 


Gly 


Gly-. 


He 


Asn 


Thr 


Ser 


Asp 


Ala 


Thr 


Glu 


Thr 


Phe 




430 








435 










440 




1362 


AGA 


CCG 


GGA 


GGA 


GGA 


GAT 


ATG 


AGG 


GAC 


AAT 


TGG 


AGA 


AGT 


Arg 


Pro 


Gly 


Gly 


Gly 


Asp 


Met 


Arg 


Asp 


Asn 


Trp 


Arg 


Ser 






445 










450 










1401 


GAA 


TTA 


TAT 


AAA 


TAT 


AAA 


GTA 


GTG 


AAA 


ATT 


GAG 


CCA 


TTA 


Glu 


Leu 


Tyr 


Lys 


Tyr 


Lys 


Val 


Val 


Lys 


He 


Glu 


Pro 


Leu 




455 




460 










465 






1440 


GGA 


GTA 


GCA 


CCC 


ACC 


AAG 


GCA 


AAG 


AGA 


AGA 


GTG 


GTG 


CAG 


Gly 


Val 


Ala 


Pro 


Thr 


Lys 


Ala 


Lys 


Arg 


Arg 


Val 


Val 


Gin 






470 










475 










480 


1479 


AGA 


GAA 


AAA 


AGA 


GCA 


GTA 


ACA 


CTA 


GGA 


GCT 


ATG 


TTC 


CTT 


Arg 


Glu 


Lys 


Arg 


Ala 


val 


Thr 


Leu 


Gly 


Ala 


Met 


Phe 


Leu 






48S 










490 










GGG 


TTC 


TTA 


GGA 


GCA 


TAA 


AGC 


TTC 


1503 










Gly 


Phe 


Leu 


Gly 


Ala 


Xaa 


ser 


Phe 














495 








500 


501 




























CLONE C6.5 








36 




GGG 


GTA 


CCT 


CTA 


TGG 


AAA 


GAA 


GCA 


ACC 


ACC 


ACT 


CTA 




Gly 


Val 


Pro 


val 


Trp 


Lys 


Glu 


Ala 


Thr 


Thr 


Thr 


Leu 






1 






5 








10 




75 


TTT 


TGT 


GCA 


TCA 


GAT 


CCT 


AAA 


GCA 


TAT 


GAC 


ACA 


GAG 


GTG 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 


Ala 


Tyr 


Asp 


Thr 


Glu 


Val 






15 








20 










25 





-15- 



WO 98/01564 



PCT/US97/09690 



10 
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30 



40 



45 



50 



55 



60 



CAT AAT GTT TGG GCC ACA CAT GCT TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 

30 35 
CCA AAC CCA CAA GAA ATG GTA TTG GAA AAT GTG ACA GAA 153 
Pro Aan Pro Gin Glu Met Val Leu Glu Asn Val Thr Glu 

40 45 50 

GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG 192 
Asp Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met 

55 60 
CAT GAG ANT ATA ATC AGT TTA TGG GAT CAA AGC CTA AAA 231 
His Glu Xaa He lie Ser Leu Trp Asp Gin Ser Leu Lys 



65 

CCA TGT GTA 
Pro Cys Val 
80 

TGC ACC 
Cys Thr 



AGT AGT 
ser Ser 
105 
AGT AGT 
Ser Ser 



AAA TTA 
Lys Leu 

TGG AAG 
Trp Lys 
95 

ACT ACA ACT 
Thr Thr Thr 



AAT 
Asn 



70 
ACC 
Thr 

GAG 
Glu 

AAT 
Asn 



CCA CTC TGT 
Pro Leu Cys 
85 

AAT GAT ACT 
Asn Asp Thr 



ATT 
lie 



75 
ACT 
Thr 



TTA AAT 
Leu Asn 

90 
AAT AGT 
Asn Ser 



AGT ACT ACA ACT 
Ser Thr Thr Thr 
120 

AAG GAG GGA GAA ATA 
Lys Glu Gly Glu lie 
130 



ACA GGC 
Thr Gly 



TTT TAT 
Phe Tyr 



ATA 
He 
145 
AGC 
Ser 



AGA GAC 
Arg Asp 



AAG 
Lys 
135 
AAG 
Lys 



AAT 
Asn 
110 
AAT 
Asn 

AAC 
Asn 

GTG 
Val 



CTT GAT GTA GTA 
Leu Asp Val Val 
160 

ACT AGC TAT AGG TTG AGA AGT 



AAA ACT 
Lys Thr 
100 

GCT ACA GCT AAT 
Ala Thr Ala Asn 
115 

AGT AGT TGG GGA GAG ATA 
Ser Ser Trp Gly Glu He 
125 

TTC AAT ATC ACC 
Phe Asn He Thr 
140 

CAA TAT CCA CTT 
Glu Tyc Ala Leu 
155 

CCA ATA GAA AAT GAT AAT 
Pro He Glu Asn Asp Asn 
165 

TGT AAC ACC TCA GTC ATT 



AGT AGT 
Ser Ser 



TGC TCT 
Cys Ser 

AAG AAA 
Lys Lys 
150 



3 5 Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val He 



170 175 
ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA 
Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro 

185 190 
ATA CAT TAT TGT ACC CCG GCT GGT TTT GCG ATT 
He His Tyn. Cys Thr Pro Ala Gly Phe Ala He 
195 200 205 

TGT AAA GAT AAA AAG TTC AAT GGA ACA GGA CCA 
Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro 



180 

ATT CCC 
He Pro 



270 



309 



348 



387 



426 



465 



504 



543 



582 



AAT GTT 
Asn Val 

GTA GTG 
Val Val 
235 
GAA GAA 
Glu Glu 



210 
AGC ACA GTA 
Ser Thr Val 
225 

TCA ACT CAA 
Ser Thr Gin 



CAA TGT 
Gin Cys 



CTG 
Leu 



AAT GCT 
Asn Ala 
260 

GAA ATT 



GAG GTA ATA ATT 
Glu Val He He 

250 
AAA ATC ATA 
Lys He He 



CTG 
Leu 
240 
AGA 
Arg 



215 

ACA CAT GGA ATT 
Thr His Gly He 
230 

TTA AAT GGC AGC 
Leu Asn Gly Ser 



CTG 
Leu 

TGC 
Cys 

AAG 
Lys 



AAG 
Lys 

AAA 
Lys 
220 
CCA 
Pro 



TCT GCC 
Ser Ala 
255 

ATA GTA CAG TTG 
He Val Gin Leu 
265 

AAT TGT ACA AGA CCC AGC AAC 
Glu He Asn Cys Thr Arg Pro Ser Asn Asn Thr 

275 280 
GGT ATA CAC ATA GGA CCA GGG AGA GCA TTT TAT 
Gly He His He Gly Pro Gly Arg Ala Phe Tyr 
290 295 



AAT TTC 
Asn Phe 

AAG GAA 
Lys Glu 
27 

AAT ACA 



CTA GCA 
Leu Ala 
245 

TCA AAC 
Ser Asn 

CCT GTA 
Pro Val 

ATA AAA 
He Lys 
285 
CCA ACA 
Ala Thr 



621 



660 



699 



738 



777 



816 



855 



894 
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GGA 


GAC 


ATA 


CGA 


GGA 


GAT 


ATA 


AGA 


CAA 


GCA 


CAT 


TGT 


AAC 




Gly 


Asp 


Zle 


Arg 


Gly 


Asp 


lie 


Arg 


Gin 


Ala 


His 


Cys 


Asn 




300 










305 










310 






ATT 


AGT 


GGA 


GCA 


AAA 


TGG 


AAT 


AAC 


ACT 


TTA 


AAG 


AAG 


GTA 


5 


lie 


Ser 


Glv 


Ala 


Lys 


Trp 


Asn 


Asn 


Thr 


Leu 


Lys 


Lys 


Val 








315 










320 












GTT 


ATA 


AAA 


TTA 


AAA 


GAA 


CAA 


TTT 


CCA 


AAT 


AAA 


ACA 


ATA 




Val 


lie 


Lvs 


Leu 


Lys 


Glu 


Gin 


Phe 


Pro 


Asn 


Lys 


Thr 


He 




325 






330 










335 






1 0 


GTC 


TTT 


AAC 


CAT 


TCC 


TCA 


GGA 


GGG 


GAC 


CCA 


GAA 


ATT 


GTA 




Val 


Phe 


Asn 


His 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 


Glu 


lie 


Val 








340 










345 










350 




ATG 


CAC 


AGT 


TTT 


AAT 


TGT 


CAA 


GGG 


GAA 


TTT 


TTC 


TAG 


TGT 




Met 


His 


Ser 


Phe 


Asn 


Cys 


Gin 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 


15 










355 










360 










AAT 


ACA 


ACG 


AAG 


CTG 


TTT 


AAT 


AGT 


ACT 


TGG 


AAT 


GAT 


ACT 




Asn 


Thr 


Thr 


Lvs 

**J w 


Leu 


Phe 


Asn 


Ser 


Thr Trp Asn 


Asp 


Thr 






365 








370 










375 






ACA 


GAG 


TCA 


AAT 


AAC 


AAT 


GAT 


AGT 


ACT 


ATT 


ACA 


CTC 


CCA 


20 


Thr 


Glu 


Ser 


Asn 


Asn 


Asn 


Asp 


Ser 


Thr 


He 


Thr 


Leu 


Pro 










380 










385 












TGC 


AGA 


ATA 


AAA 


CAA 


ATT 


ATA 


AAC 


ATG 


TGG 


ran 


GAA 


GTA 




Cys 


Arg 


lie 


Lys 


Gin 


lie 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 




390 






395 










400 






25 


GGA 


AAA 


GCA 


ATG 


TAT 


GCC 


CCT 


CCC 


ATC 


AGA 


GGA 


GAA 


ATT 




Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


He 


Arg 


Glv 


Glu 


He 




40S 










410 










415 




AAA 


TGT 


TCA 


TCA 


AAT 


ATT 


ACA 


GGA 


CTA 


CTG 


TTA 


ACA 


AGA 




Lvs 


Cvs 


Ser 


Ser 


Asn 


lie 


Thr 


Gly 


Leu 


Leu 


Leu 


Thr 


Arg 


30 






420 










425 










GAT 


GGT 


GGT 


ATT 


AAC 


ACT 


AGC 


GAT 


GCC 


ACC 


GAG 


ACC 


TTC 




Asp 


Glv 


Glv 


I le 


Asn 


Thr 


Ser 


Asp 


Ala 


Thr 


Glu 


Thr 


Phe 




430 








435 










440 






AGA 


CCG 


GGA 


GGA 


GGA 


GAT 


ATG 


AGG 


GAC 


AAT 


TGG 


AGA 


AGT 


35 


Ara 


Pro 


Glv 
j 


Gly 


Gly 


Asp 


Met 


Arg 


Asp 


Asn 


Tro 


Arg 


Ser 






445 










450 












GAA 


TTA 


TAT 


AAA 


TAT 


AAA 


GTA 


GTG 


AAA 


ATT 


GAG 


CCA 


TTA 




Glu 


Leu 


Tyr 


Lys 


Tyr 


Lys 


Val 


Val 


Lys 


lie 


Glu 


Pro 


Leu 




455 






460 










465 






4 0 


GGA 


GTA 


GCA 


CCC 


ACC 


AAG 


GCA 


AAG 


AGA 


AGA 


GTG 


GTG 


CAG 




Gl v 


Val 


Ala 


Pro 


Thr 


Lys 


Ala 


Lys 


Arg 


Arg 


Val 


val 


Gin 






470 










475 










480 




AGA 


GAA 


AAA 


AGA 


GCA 


GTA 


ACA 


CTA 


GGA 


GCT 


ATG 


TTC 


CTT 




Arg 


Glu 


Lvs 


Arg 


Ala 


Val 


Thr 


Leu 


Gly 


Ala 


Met 


Phe 


Leu 


45 








435 










490 










GGG 


TTC 


TTG 


GGA 


GCA 


TAA 


AGC 


TTC 


1503 










Glv 


Phe 


Leu 


Gly 


Ala 


Xaa 


Ser 


Phe 














495 








500 


501 












50 
















CLONE C8.3 










G 


GTA 


CCT 


GTA 


TGG 


AAA 


GAA 


GCA 


ACC 


ACC 


ACT 


CTA 


TTT 






Val 


Pro 


Val 


Trp 


Lys 


Glu 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 






1 






5 










10 








TGT 


GCA 


TCA 


GAT 


GCT 


AAA 


GCA 


TAT 


GAT 


ACA 


GAG 


GTA 


CAT 


55 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 


Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 






15 








20 










25 




AAT 


GTT 


TGG 


GCT 


ACA 


CAT 


GCC 


TGT 


GTA 


CCC 


ACA 


GAC 


CCC 




Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 


Pro 


Thr 


Asp 


Pro 










30 










35 








60 


AAC 


CCA 


CAA 


GAA 


GTA 


GTA 


TTG 


CAA 


AAT 


GTA 


ACA 


GAA 


AAT 




Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Glu 


Asn 


Val 


Thr 


Glu 


Asn 






40 










45 










50 






TTT 


AAC 


ATG 


TGG 


AAA 


AAT 


AAC 


ATG 


CTA 


GAA 


CAG 


ATG 


CAT 




Phe 


Asn 


Met 


Trp 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


65 








55 










60 











933 

972 

1011 

1050 

1089 

1128 

1167 

1206 

1245 

1284 

1323 

1362 

1401 

1440 

1479 



37 

76 

115 

154 

193 
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GAG GAT ATA ATC ACT TTA TGG GAT CAA AGT CTA AAG CCA 232 

Glu Asp lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro 

65 70 75 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 
5 Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
80 85 90 

ACT AAT TTG GAG AAT GCT AAT AAT ACC GAG AAT GCT AAT 310 
Thr Asn Leu Glu Asn Ala Asn Asn Thr Glu Asn Ala Asn 
95 100 
10 AAT ACC AAT AAT TAT ACC TTG GGG ATG GAG AG A GGT GAA 349 
Aan Thr Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu 

105 110 115 

ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA AGC TTA AG A 388 
He Lys Asn Cys Ser Phe Asn He Thr Thr Ser Leu Arg 
15 120 125 

GAT AAG GTG AAA AAA GAA TAT GCA TTG TTT TAT AAA CTT 427 
Asp Lys Val Lys Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
130 135 140 

GAT GTA GTA CAA ATA GAT AAT AGT ACC AAC TAT AGG CTG 466 
20 Asp Val Val Gin lie Asp Asn Ser Thr Asn Tyr Arg Leu 
145 150 155 

ATA AGT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 
He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro 
160 165 
25 AAG GTA TCC TTT GAG CTA ATT CCC ATA CAT TAT TGT GCC 544 
Lys Val Ser Phe Glu Leu He Pro He His Tyr Cys Ala 

170 175 180 

CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG AAG 583 
Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys 
30 185 190 

TTC AAT GGA ACA GGA CCA TGT AAA AAT GTC AGC ACA GTA 622 
Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val 
195 200 205 

CAA TGT ACA CAT GGA ATT AG A CCA GTA GTA TCA ACT CAA 661 
35 Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin 
210 215 220 

CTA CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG ATA GTA 700 
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu He Val 
225 230 
40 ATT AGA TCT GAA AAT ATC ACA GAC AAT GCT AAA AC?C ATA 739 
lie Arg Ser. Glu Asn He Thr Asp Asn Ala Lys Thr lie 

235 240 245 

ATA GTG CAG CTA AAT GAA TCT ATA GTG ATT AAT TGT ACA 7 78 
He Val Gin Leu Asn Glu Ser He Val He Asn Cys Thr 
45 250 255 

AGA CCC AAT AAC AAC ACA AGA AAA AGT ATA AAT ATA GGA 817 
Arg Pro Asn Asn Asn Thr Arg Lys Ser He Asn lie Gly 
260 265 270 

CCA GGG AGA GCA TTC TAT ACA ACA GGA GAC ATA ATA GGA 856 
50 Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp lie lie Gly 
275 280 285 

GAT ATA AGA CAA GCA CAT TGT AAC CTT AGT AAA ACA CAA 895 
Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin 
290 295 
55 TGG GAA AAA ACG TTA AGA CAG ATA GCT ATA AAA TTA GAA 934 
Trp Glu Lys Thr Leu Arg Gin He Ala lie Lys Leu Glu 

300 305 310 

GAA AAA TTT AAG AAT AAA ACA ATA GCC TTT AAT AAA TCC 973 
Glu Lys Phe Lys Asn Lys Thr He Ala Phe Asn Lys Ser 
€0 315 320 

TCA GGA GGG GAC CCA GAA ATT CTA ATG CAC AGT TTT AAT 1012 
Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn 
325 330 33S 
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Tip 
X its 
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He 
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He 
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ID 
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Leu 
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v»A 1 


Vj*J I 
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GGT 
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*»i x y 
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Asn 


Thr 
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vj x y 
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i nr 


r l ii 
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TTT 
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%*V* X 


nc a 
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GGA 
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ATG 


AGG 


GAC 


AAT 


TGG 


AGA 


1324 




Phe 


Arg 


Pro 




Glv 


Glv 


ASD 


Met 


Arg 


Asp 


Asn 


Trp 


Arg 








4 30 








435 










440 






25 


AGT 


GAA 


TTA 


TAT 


AAA 


TAT 


AAA 


GTA 


GTA 


AG A 


ATT 


GAA 


CCA 


1363 




Ser 


Glu 


Leu 


Tyr 


Lys 


Tyr 
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Val 


val 


Arg 


I le 


Glu 


Pro 
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bin 
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CAG 
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GTG 


GTG 


1402 




Leu 




VOX 
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Lys 
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Arg 


Val 


Val 




J u 
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465 
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Onn 


AAA 


AG A 
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1441 




Gin 


A rrr 
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Val 
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He 
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Ala 


Met 
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47S 
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Leu 


Gly 


Asp 
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TTT 


37 
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Thr 
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1 
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Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 
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Thr 
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Pro 
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Glu 
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232 
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271 
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60 
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AGA AAA 
Arg Lys 

GAT AAG 
5 Asp Lys 
130 

GAT GTA 
Asp Val 

10 ATA AGT 
He Ser 

AAG GTA 
Lys Val 
15 170 
CCG GCT 

Pro Ala 

TTC AAT 
20 Phe Asn 
195 

CAA TGT 
Gin Cys 

25 CTA CTG 
Leu Leu 

ATT AGA 
He Arg 
30 235 
ATA GTG 
He Val 

AGA CCC 
35 Arg Pro 
260 

CCA GGG 
Pro Gly 

40 GAT ATA 
Asp He 

TGG GAA 
Trp Glu 
45 300 
GAA AAA 
Glu Lys 

TCA GGA 
50 Ser Gly 
325 

TGT GGA 
Cys Gly 

55 TTT AAT 
Phe Asn 

ACC GGG 
Thr Gly 
60 365 
TGC AGA 
Cys Arg 



AAC TGC TCT TTC 
Asn Cys Ser Phe 

120 

GGG AAA AAA GAA 
Gly Lys Lys Glu 
135 

GTA CAA ATA GAT 
Val Gin He Asp 
145 

TGT AAT ACC TCA 
Cys Asn Thr Ser 
160 

TCC TTT GAG CCA 
Ser Phe Glu Pro 

GGT TTT GCG ATT 
Gly Phe Ala He 
185 

GGA ACA GGA CCA 
Gly Thr Gly Pro 
200 

ACA CAT GGA ATT 

Thr His Gly He 
210 

TTA AAT GGC AGT 
Leu Asn Gly Ser 
225 

TCT GAA AAT ATC 
Ser Glu Asn He 

CAG CTA AAT GAA 
Gin Leu Asn Glu 
250 

AAT AAC AAC ACA 
Asn Asn Asn Thr 
265 

AGA GCA TTC TAT 
Arg Ala Phe Tyr 

275 

AGA CAA GCA CAT 
Arg Gin Ala His 
290 

AAA ACG TTA AGA 
Lys Thr Leu Arg 

TTT AAG AAT AAA 
Phe Lys Asn Lys 
315 

GGG GAC CCA GAA 
Gly Asp Pro Glu 
330 

GGG GGA TTT TTC 
Gly Gly Phe Phe 
340 

AGT ACC TGG AAT 
Ser Thr Trp Asn 
355 

GAT CGT ACT GAA 
Asp Arg Thr Glu 

ATA AAA CAA ATC 

lie Lys* Gin He 
380 



AAT ATC ACC ACA 
Asn He Thr Thr 
125 

TAT GCA TTG TTT 
Tyr Ala Leu Phe 

AAT AGT ACC AAC 
Asn Ser Thr Asn 
150 

GTC ATT ACA CAG 
Val He Thr Gin 
165 

ATT CCC ATA CAT 
lie Pro He His 
175 

CTA AAG TGT AAA 
Leu Lys Cys Lys 
190 

TGT AAA AAT GTC 
Cys Lys Asn Val 

AGA CCA GTA GTA 
Arg Pro Val Val 
215 

CTA GCA GAA GAA 
Leu Ala Glu Glu 
230 

ACA GAC AAT GCT 
Thr Asp Asn Ala 
240 

TCT ATA GTG ATT 
Ser He Val He 
255 

AGA AAA AGT ATA 
Arg Lys Ser He 

ACA ACA GGA GAC 
Thr Thr Gly Asp 
280 

TGT AAC CTT AGT 
Cys Asn Leu Ser 
295 

CAG ATA GCT ATA 
Gin He Ala He 
305 

ACA ATA GCC TTT 
Thr He Ala Phe 

320 

ATT GTA ATG CAC 
He Val Met His 

TAC TGT AGT ACG 
Tyr Cys Ser Thr 
345 

TTA ACA CAA CCG 
Leu Thr Gin Pro 
360 

GAG TTA AAT ATT 
Glu Leu Asn He 
370 

ATA AAC TTG TGG 
I le Asn Leu Trp 
385 



AGC TTA AGA 388 
Ser Leu Arg 

TAT AAA CTT 427 
Tyr Lys Leu 
140 

TAT AGG CTG 466 
Tyr Arg Leu 
155 

GCC TGT CCA 505 
Ala Cys Pro 

TAT TGT GCC 544 
Tyr Cys Ala 
180 

GAT AAG AAG 583 
Asp Lys Lys 

AGG ACA GTA 622 
Arg Thr Val 
205 

TCA ACT CAA 661 
Ser Thr Gin 
220 

GAG ATA GTA 700 
Glu lie Val 

AAA ACC ATA 7 39 
Lys Thr He 
245 

AAT TGT ACA 7 78 
Asn Cys Thr 

AAT ATA GGA 817 
Asn He Gly 
270 

ATA ATA GGA 856 
He He Gly 
285 

AAA ACA CAA 895 
Lys Thr Gin 

AAA TTA GAA 934 
Lys Leu Glu 
310 

AAT AAA TCC 973 
Asn Lys Ser 

AGT TTT AAT 1012 
Ser Phe Asn 
335 

AGA AAA CTG 1051 
Arg Lys Leu 
350 

TTT AGT AAT 1090 
Phe Ser Asn 

ACA CTC CCA 1129 
Thr Leu Pro 
375 

CAG GAA GTA 1168 
Gin Glu Val 
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GGC AAA GCA ATG TAT GCC CCT CCC ATC AG A GGA CAA ATT 1207 

Gly Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Gin lie 

390 395 400 

AGA TGT TCA TCA AAT ATT ACA GGG CTA CTA TTA AGG AG A 1246 

5 Arq CY9 Ser Ser Asn lie Thr Gly Leu Leu Leu Arg Arg 

405 410 415 

GAT GGT GGA AGT AAC ACC AGT GAC AAC CAG ACT GAG ACC 1285 

Asp Gly Gly Ser Asn Thr Ser Asp Asn Gin Thr Glu Thr 

420 425 

10 TTT AGA CCT GGG GGA GGA GAT ATG AGG GAC AAG TGG AGA 1324 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Lys Trp Arg 

430 435 440 

AGT GAA TTA TAT AAA TAT AAA GTA GTA AGA ATT GAA CCA 1363 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg He Glu Pro 

15 445 450 

TTA GGA GTA GCA CCC ACC CAG GCA AAG AGA AGA GTG GTG 1402 

Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arg Val Val 

455 460 465 

CAA AGA GAA AAA AGA GCA GTG GGG ATA GGA GCT ATG TTC 1441 

20 Gin Arg Glu Lys Arg Ala Val Gly He Gly Ala Met Phe 

470 475 480 

CTT AGG TTC TTA GGA GAT AAA GCT TCT AGA GTC 14 74 

Leu Arg Phe Leu Gly Asp Lys Ala Ser Arg Val 

485 490 491 

25 

CLONE CIS. 2 

CTC GAG GTA CCT CTA TGG AAA GAA GCA ACT ACC ACT 36 * 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

1 5 10 

3 0 CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT AAT ACA GAG 75 

Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu 

15 20 25 

AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 

Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 

35 30 35 

GAT CCC AAC CCA CAA GAA GTA GTA TTG GGA AAT GTG ACA 153 

Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr 

40 45 50 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA 192 

40 Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 

. 55 60 

ATG CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA AGT CTA 231 

Met His Glu Asp He lie Ser Leu Trp Asp Gin Ser Leu 

65 70 75 

4 5 AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA 270 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 

80 85 90 

AAT TGC ACT GAT GAT TTA GGG AAT GCT ACT AAT ACC AAT 309 

Asn Cys Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn 

50 95 100 

AGT AGT GCC ACT ACC AAT AGT AGT AGT TGG GAA GAA ATG 348 

Ser Ser Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met 

105 110 H5 

AAG GGG GAA ATG AAA AGA TGC TCT TTC AAT ATC ACC ACA 387 

55 Lys Gly Glu Met Lys Arg Cys Ser Phe Asn He Thr Thr 

AGC ATA AGA GAT AAG ATT AAG AAA GAA CAT GCA CTT TTC 426 

Ser He Arg Asp Lys He Lys Lys Glu His Ala Leu Phe 

130 135 140 

60 TAT AGA CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACC 465 

Tvr Arq Leu Asp val Val Pro He Asp Asn Asp Asn Thr 

145 150 155 
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ACA TAT AGG TTG ATA AAT TGT AAT ACC TCA GTC ATT ACA 504 
Thr Tyr Arg Leu He Asn Cys Asn Thr Ser Val He Thr 

160 165 
CAG GCC TGT CCA AAG GTA TCA TTT GAG CCA ATT CCC ATA 543 
5 Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He 
170 175 180 

CAT TTT TGT GCC CCG CCT GGT TTT GCC ATT CTA AAG TGT 582 
Hia Phe Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys 
185 190 
10 AAT AAT AAG ACG TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 
Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn 
195 200 205 

GTC AGT ACA GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660 
Val Ser Thr Vat Gin Cys Thr His Gly He Arg Pro Val 
15 210 215 220 

GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 699 
Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 

225 230 
GAA GAG GTA ATA ATT AGA TCT GAC AAT ATC ACA GAC AAT 738 
20 Glu Glu Val He He Arg Ser Asp Asn He Thr Asp Asn 
235 240 245 

ACT AAA ACC ATT ATA GTA CAG CTA AAC GAA TCT GTA GTA 77 7 
Thr Lys Thr He He Val Gin Leu Asn Glu Ser Val Val 
250 255 
25 ATT AAT TGT ACA AGA CCC AAC AAC AAT ACA AGA AAA AGT 816 
He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arq Lys Ser 
260 265 270 

ATA CAT ATA GGA CCA GGC AGT GCA TTT TTT GCA ACA GGA 855 
He His He Gly Pro Gly Ser Ala Phe Phe Ala Thr Glv 
30 275 280 285 

GAA ATA ATA GGA GAT ATA AGA CAA GCA CAC TGT AAC CTT 894 
Glu He lie Gly Asp He Arg Gin Ala His Cys Asn Leu 

290 295 
AGT AGA ACA CAA TGG AAT AAC ACT TTA GGA AAG ATA GTC 933 
35 Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val 
300 305 3i(5 

ATA AAA TTA AGA GAA CAA TTT AGA AAA CAA TTT GGA GAA 972 
He Lys Leu Arg Glu Gin Phe Arg Lys Gin Phe Gly Glu 
315 320 
40 AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 
Lys Thr He-. Val Phe Asn Arg Ser Ser Gly Gly Asp Pro 
325 330 335 

GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA GGG GAA TTT 1050 
Glu He Ala Met His Ser Phe Asn Cys Gly Gly Glu Phe 
45 340 345 350 

TTC TAC TGT AAC ACA ACA GCA CTG TTT AAT AGT ACC TGC 1089 
Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp 

355 360 
AAT GTT ACT AAA GGG TTG AAT AAC ACT GAA GGA AAT AGC 1128 
Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Asn Ser 

365 370 375 

ACA GGA GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA 1167 
Thr Gly Asp Glu Aan He He Leu Pro Cys Arg He Lys 
380 385 
55 CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 
Gin He lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 
390 395 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 1245 
Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser 
60 405 410 415 

AAC ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 1284 
Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 
420 425 



50 
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10 



15 



20 



AAG 


AAC 


GAG 


AGC 


ATC 


ACC 


ACC 


GAG 


GTC 


TTC 


AGA 


CCT 


GGA 


1323 


Lys 


Asn 


Glu 


Ser 


lie 


Thr 


Thr 


Glu 


Val 


Phe 


Arg 


Pro 


Gly 




430 










435 










440 






GGA 


GGA 


GAT 


ATG 


AGG 


GAC 


AAT 


TGG 


AGA 


AGT 


GAA 


TTA 


TAT 


1362 


Gly 


Gly 


Asp 


Met 


Arg 


Asp 


Asn 


Trp 


Arg 


Ser 


Glu 


Leu 


Tyr 






445 










450 












AAA 


TAT 


AAA 


GTA 


GTA 


AAA 


ATT 


GAA 


CCA 


TTA 


GGA 


GTA 


GCG 


1401 


Lys 


Tyr 


Lys 


val 


val 


Lys 


He 


Glu 


Pro 


Leu 


Gly 


Val 


Ala 




455 






460 










465 








CCC 


ACC 


AAG 


GCA 


AAG 


AGA 


AGA 


GTG 


GTG 


CAG 


AGA 


GAA 


AAA 


1440 


Pro 


Thr 


Lys 


Ala 


Lys 


Arg 


Arg 


Val 


Val 


Gin 


Arg 


Glu 


Lys 








470 








475 










480 




AGA 


GCA 


GTG 


GGA 


ACA 


ATA 


GGA 


GCT 


ATG 


TTC 


CTT 


GGG 


TTC 


1479 


Arg 


Ala 


Val 


Gly 


Thr 


He 


Gly 


Ala 


Met 


Phe 


Leu 


Gly 


Phe 








485 










490 










TTG 


GGA 


GCA 


TAA 


AGC 


TTC 


TAG 


ACT 


CGA 


CCT 


GCA 


1512 




Leu 


Gly 
495 


Ala 


Xaa 


Ser 


Phe 


Xaa 
500 


Ser 


Arg 


Pro 


Ala 
504 









CLONE CIS. 3 

CTC GAG GTA CCT GTG TGG AAA GAA GCA ACT ACC ACT 36 

Leu Glu Val Pro Val Trp Lys Giu Ala Thr Thr Thr 

15 10 

CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT AAT ACA GAG 7 5 

25 Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu 

15 20 25 

AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 

Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 

30 35 

30 GAT CCC AAC CCA CAA GAA GTA GTA TTG GGA AAT GTG ACA 153 

Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr 

40 45 50 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA 192 

Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 

35 55 60 

ATG CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA AGT CTA 231 
Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 

65 70 75 

AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA 270 

40 Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 

8a 85 90 

AAT TGC ACT GAT GAT TTA GGG AAT GCT ACT AAT ACC AAT 309 

Asn Cys Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn 

95 100 

45 AGC AGT GCC ACT ACC AAT AGT AGT AGT TGG GAA GAA ATG 348 

Ser Ser Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met 

105 HO 115 

AAG GGG GAA ATO AAA AGG TGC TCT TTC AAT ATC ACC ACA 387 

Ly9 Gly Glu Met Lys Arg Cys Ser Phe Asn He Thr Thr 

50 120 125 

AGC ATA AGA GAT AAG ATT AAG AAA GAA CAT GCA CTT TTC 426 

Ser He Arg Asp Lys He Lys Lys Glu His Ala Leu Phe 

130 135 140 

TAT AGA CTT GAT GTA GTA CCA ATA CAT AAT GAT AAT ACC 465 

55 Tvr Arq Leu Asp Val Val Pro He Asp Asn Asp Asn Thr 

145 150 155 

ACA TAT AGG TTG ATA AAT TGT AAT ACC TCA GTC ATT ACA 504 

Thr Tyr Arg Leu He Asn Cys Asn Thr Ser Val lie Thr 

160 165 

60 CAG GCC TGT CCA AAC GTA TCA TTT GAG CCA ATT CCC ATA 543 

Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He 

170 175 180 

CAT TTT TGT CCC CCC GCT GGT TTT GCG ATT CTA AAC TGT 582 

His Ph Cys Ala Pro Ala Gly Phe Ala He L u Lys Cys 

65 185 190 
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AAT AAT AAG ACC TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 

Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn 
19S 200 205 

GTC AGT ACA GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660 
5 Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val 
210 215 220 

GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 699 
Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
225 230 
10 GAA GAG GTA ATA ATT AG A TCT GGC AAT ATC ACA GAC AAT 738 
Glu Glu Val He He Arg Ser Gly Asn He Thr Asp Asn 

235 240 245 

ACT AAA ACC ATT ATA GTA CAG CTA AAC GAA TCT GTA GTA 777 
Thr Lys Thr He He Val Cln Leu Asn Glu Ser Val Val 
15 250 255 

ATT AAT TGT ACA AG A TCC AAC AAC AAT ACA AG A AAA AGT 816 
lie Asn Cys Thr Arg Ser Asn Asn Asn Thr Arg Lys Ser 
260 26S 270 

ATA CAT ATA GGA CCA GGC AGT GCA TTT TTT GCA ACA GGA 85 5 
20 He His lie Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 
275 280 285 

GAA ATA ATA GGA GAT ATA AG A CAA GCA CAC TGT AAC CTT 894 
Glu He He Gly Asp He Arg Gin Ala His Cys Asn Leu 
290 295 
25 AGT AGA ACA CAA TGG AAT AAC ACT TTA GGA AAG ATA GTC 933 
Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys lie Val 

300 305 3i0 

ATA AAA TTA AGA GAA CAA TTT AGA AAA CAA TTT GCA GAA 972 
He Lys Leu Arg Glu Cln Phe Arg Lys Gin Phe Gly Glu 
30 315 320 

AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GCG GAC CCG 1011 
Lys Thr lie Val Phe Asn Arg Ser Ser Gly Gly Asp Pro 
325 330 335 

GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA CGG GAA TTT 1050 
35 Glu He Ala Met His Ser Phe Asn Cys Gly Gly Glu Phe 
340 345 350 

TTC TAC TGT AAC ACA ACA GCA CTG TTT AAT AGT ACC TGG 1089 
Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp 

4 0 AAT GTT ACT AAA GGG TTG AAT AAC ACT GAA GGA AAT AGC 1128 
Asn Val The Lys Gly Leu Asn Asn Thr Glu Gly Asn Ser 

365 370 375 

ACA GGG GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA 1167 
Thr Gly Asp Glu Asn He lie Leu Pro Cvs Arq He Lvs 

45 380 385 ' 

CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 
Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met 
390 ^ 395 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 124 5 

50 Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser 
405 410 415 

AAT ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 1284 
Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 
420 425 

55 AAG AAC GAG AGC ATC ACC ACC GAG GTC TTC AGA CCT GGA 1323 
Lys Asn Glu Ser He Thr Thr Glu Val Phe Arg Pro Gly 

430 435 440 

GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT 1362 
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

60 445 450 

AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA CCG 1401 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
455 460 465 
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Thr Asp 
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TTC AAT 389 
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GAA TAT 428 
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GAT AAT 467 
Asp Asn 
155 
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ACT CCC 54 5 
Thr Pro 
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60 



1052 
1091 



GAA CAA GAG GTG GTA ATT AGA TCT GAC AAT TTC ATA GAC 740 
Glu Glu Glu val Vai lie Arg Ser Asp Asn Phe lie Asp 

235 240 245 

AAT ACT AAA ACC ATA ATA GTA CAG CTG AAA GAA TCT GTA 779 
5 Asn Thr Lys Thr He He Val Gin Leu Lys Glu Ser Val 
250 255 
GAA ATT AAT TGT ATA AGA CCC AAC AAT AAT ACA AGA AAA 818 
Glu He Asn Cys He Arg Pro Asn Asn Asn Thr Arg Lys 
260 265 270 

10 GGT ATA CAT ATA GGA CCA GGG AGA GCA TGG TAT GCA ACA 857 
Gly He His He Gly Pro Gly Arg Ala Trp Tyr Ala Thr 
27 5 280 285 

GGA GAA ATA GTA GGA GAT ATA AGA AAG GCA TAT TGT AAC 896 
Gly Glu lie Val Gly Asp He Arg Lys Ala Tyr Cys Asn 
ID 290 295 

ATT ACT AGA ACA AAA TGC AAT AAC ACT TTA ATA CAG ATA 93 5 
He Ser Arg Thr Lys Trp Asn Asn Thr Leu He Gin He 

300 305 310 

GCT AAC AAA TTA AAA CAA AAA TAT AAT ACA ACA ATA AGC 974 
-20 Ala Asn Lys Leu Lys Glu Lys Tyr Asn Thr Thr lie Ser 
315 320 
TTT AAT CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG 1013 
Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu He Val Thr 
325 330 335 

2 5 CAT ACT TTT AAT TGT GGA GGG GAG TTT TTC TAC TGT GAT 
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asd 
340 345 35 5 

TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT TTA AAT GGT 
Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Leu Asn Glv 
30 355 360 

ACT TGG AAT TTT ACT GCA GGG TCA AAT GAA ACT GAA GGC 1130 
Thr Trp Asn Phe Thr Ala Gly Ser Asn Glu Thr Glu Glv 
365 370 375 

- R ** T ACA CTC CCA TGC AGA ATA AAA CAA ATT ATA AAC 1169 

J5 Asn He Thr Leu Pro Cys Arg He Lys Gin He He Asn 
380 385 
AGG TGG CAG GAA GTA GGG AAA GCA ATG TAT GCC CCT CCC 1208 
Arg Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
390 395 400 

4 0 ATC AGT GGA CAA ATA AAA TGC TCA TCA AAC ATT ACA GGG 1247 
lie Ser Gly. Gin lie Lys Cys Ser Ser Asn He Thr Gly 
405 410 415 

ATG ATA TTA ACA AGG GAT GGT GGT AAC GAG AAC AAT AAT 1286 
Met He Leu Thr Arg Asp Gly Gly Asn Glu Asn Asn Asn 
45 420 425 

GAG AGC AGT ACT ACT GAG ACC TTC AGA CCG GGA GGA GGA 1325 
Glu Ser Ser Thr Thr Glu Thr Phe Arg Pro Gly Gly Gly 

430 435 440 

GAT ATG AGG AAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 1364 
=>U Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
445 450 
AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1403 
Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 
ec 455 460 465 

55 AAG GCA AAG AGA AGA GTG GTG CAG AGA GAA AAA AGA CCA 1442 
Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
4 ™ 475 480 

GTG GGA GCG CTA GGA GCT ATG TTC CTT GGG TTC TTA GGA 1481 
Val Gly Ala Leu Gly Ala Met Phe Leu Gly Phe Leu Glv 

485 490 
GCA TAA AGC TTC TAG ACC GAC TCT AGA GGA TCC 1514 
Ala Xaa Ser Phe Xaa Thr Asp Ser Arg Gly Ser 
49S 500 504 
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CLOWE C7.10 

G OTA CCT GTG TGG AAG GAA GCA ACC ACC ACT CTA TTC 37 
Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe 
15 10 
5 TGT GCA TCA GAT GCT AGA GCA TAT GAC ACA GAG GTA CAT 76 
Cys Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val His 
15 20 25 

AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC CCT 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 
10 30 35 

AGT CCA CAA GAA GTA TTT TTG GGA AAT GTG ACA GAA AAT 154 
Ser Pro Gin Glu Val Phe Leu Gly Asn Val Thr Glu Asn 

40 45 50 

TTT AAT ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG TAT 193 
15 Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Tyr 
55 60 
GAG GAT ATA ATT AGT TTA TGG GAT CAA AGC TTA AAG CCA 232 
Glu Asp lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro 
65 70 75 

2 0 TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 

Cys Val Ly3 Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
80 85 90 

AGT GAT TAT AGG AAT GCT ACT GAT TAT AAG AAT GCT ACT 310 
Ser Asp Tyr Arg Asn Ala Thr Asp Tvr Lys Asn Ala Thr 
25 95 * 100 

GAT ACC ACT AGT AGT AAC GAG GGA AAG ATG GAG AGA GGA 349 
Asp Thr Thr Ser Ser Asn Glu Gly Lys Met Glu Arg Gly 

105 110 115 

GAA ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA AGC ATA 388 
30 Glu He Lys Asn Cys Ser Phe Asn lie Thr Thr Ser He 
120 125 
AAA AAT AAG ATG CAG AAA GAA TAT GCA CTT TTC TAT AAA 427 
Lys Asn Lys Met Gin Lys Glu Tyr Ala Leu Phe Tyr Lys 
130 135 140 

3 5 CTT AAT ATA GTA CCA ATA GAT AAT ACA ACC TAT ACA TTG 466 

Leu Asn He Val Pro He Asp Asn Thr Ser Tyr Thr Leu 
145 150 155 

ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 
He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro 
40 160 165 

AAG GTA TCC, TTT GAA CCA ATT CCC ATA CAT TAT TGT GCT 544 
Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys Ala 

170 175 180 

CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG AAG 583 
45 Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys 
185 190 
TTC AGT GGA AAA GGA GAA TGT AAA AAT GTC AGC ACA GTA 622 
Phe Ser Gly Lys Gly Glu Cys Lys Asn Val Ser Thr Val 
195 200 205 

50 CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA 661 
Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin 
210 215 220 

CTG CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG GTG GTA 700 
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
55 225 230 

ATT AGA TCT GAC AAT TTC ACA GAC AAT ACT AAA ACC ATA 739 
lie Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys Thr He 

235 240 245 

ATA GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT TGT ATA 7 78 
60 lie Val Gin Leu Lys Glu Ser Val Glu lie Asn Cys He 
250 255 
AGA CCC AAC AAT AAT ACA AGA AAA GGT ATA CAT ATA GGA 817 
Arg Pro Asn Asn Asn Thr Arg Lys Gly He His He Gly 
260 265 270 
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CCA GGG AGA GCA TGG TAT GCA ACA GGA GAA ATA GTA CGA 856 
Pro Gly Arg Ala Trp Tyr Ala Thr Gly Glu He Val Gly 
275 280 285 

GAT ATA AGA CAG GCA TAT TGT AAC ATT ACT AGA ACA AAA 895 
5 Asp He Arg Gin Ala Tyr Cys Asn lie Ser Arg Thr Lys 

290 295 
TGG AAT AAC ACT TTA ATA CAG ATA GCT AAC AAA TTA AAA 934 
Trp Asn Asn Thr Leu He Gin He Ala Asn Lys Leu Lvs 
300 305 310 

10 GAA AAA TAT AAT ACA ACA ATA AGC TTT AAT CGA TCC TCA 97 3 
Glu Lys Tyr Asn Thr Thr lie Set Phe Asn Arg Ser Ser 

315 320 
GGA GGG GAC CCA GAA ATT GTA ACC CAT AGT TTT AAT TGT 1012 
Gly Gly Asp Pro Glu lie Val Thr His Ser Phe Asn Cys 
15 325 330 335 

GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT 1051 
Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe 
3 <*0 345 350 

AAT AGT ACT TGG AAT TTA AAT GGT ACT TGG AAT TTT ACT 1090 
Asn Ser Thr Trp Asn Leu Asn Gly Thr Trp Asn Phe Thr 

355 360 
GCA GGG TCA AAT GAA ACT GAA GGC AAT ATC ACA CTC CCA H 29 
Ala Gly Ser Asn Glu Thr Glu Gly Asn He Thr Leu Pro 
365 370 375 

25 TGC AGA ATA AAA CAA ATT ATA AAC AGG TGG CAG GAA GTA 1158 
Cys Arg He Lys Gin He He Asn Arg Trp Gin Glu Val 

380 385 
GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA ATA 1207 
Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly Gin He 
30 390 395 40Q 

AGA TGC TCA TCA AAC ATT ACA GGG ATG ATA TTA ACA AGG 1246 
Arg Cys Ser Ser Asn He Thr Gly Met He Leu Thr Arg 
405 410 4i| 

GAT GGT GGT AAC GAG AAC AAT AAT GAG AGC AGT ACT ACT 128 5 
Asp Gly Gly Asn Glu Asn Asn Asn Glu Ser Ser Thr Thr 

420 425 
GAG ACC TTC AGA CCG GGA GGA GGA GAT ATG AGG AAC AAT 132 4 
Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn 

430 435 440 

4 0 TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT 1363 
Trp Arg Ser. Glu Leu Tyr Lys Tyr Lys Val Val Lys lie 

445 450 
GAG CCA TTA GGA GTA GCA CCC ACC GAC TCT AGA GGA TCC 1402 
Glu Pro Leu Gly Val Ala Pro Thr Asp Ser Arg Gly Ser 
45 455 4 6 o 465 

TCT AGA 1408 
Ser Arg 
469 

50 CLONE Cll. 5 

GAG GTA CCT GTG TGG AAA GAA GCA ACC ACT ACT CTA 36 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
1 5 io 

I" TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GGG GTG 75 
35 Phe °Y 3 A *-a Ser Asp Ala Lys Ala Tyr Asp Thr Gly Val 
15 20 25 

CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asd 
30 35 
60 CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT CTG ACA GAA 153 
Pro Asn Pro Gin Glu lie Glu Leu Val Asn Val Thr Glu 

40 45 so 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 
Asp Phe Asn Met Trp Lys Asn Lys Met Val Asp Gin Met 
6 5 55 60 
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TCC TCA GGA GCG GAC CCA GAA ATT GTA ATG CAC ACT TTT 1050 
Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe 
340 345 35Q 

AAC TGT GGA GGG GAG TTT TTC TAC TCT AAT ACA ACA CAA 1089 
Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gin 

355 360 
CTG TTT AAT ACT ACT TGG AAT AAT ACT GAA GCG ACA AAT 1128 
Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn 

365 370 375 

AGC ACT GAA GGA AAT AGC ACA ATC ACA CTC CCA TGC AGA 1167 
Ser Thr Glu Gly Asn Ser Thr lie Thr Leu Pro Cys Arq 

380 385 
ATA AAA CAA ATT ATA AAT ATG TGG CAG GAA GTA GGA AAA 1206 

Jil LyS Gln Iie Ile Asn Met Tr P Gln G * u Val Gly Lys 
15 390 395 40Q y y 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT AGA TCC 124 5 

Ala Thr Tyr Ala Pro Pro Ile Arg Gly Arg He Arg Cys 

405 410 4i 5 

ATA TCA AAT ATT ACA GGA CTG CTA TTA ACA AGA GAT GGT 1284 

He Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly 

420 425 

GGT AGG AAT GTC ACA AAC AAT ACC GAA ACC TTC AGA CCT 1323 

Gly Arg Asn Val Thr Asn Asn Thr Glu Thr Phe Arq Pro 

430 4 35 4 4o ■ 

2 5 GGA GGA GGA GAC ATG AGG GAC AAT TGG AGA AGT GAA TTA 1362 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

445 450 
TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly He 
30 455 460 465 

GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 
470 475 48o 

AAA AGA GCA GCA CTA GGA GCC TTG TTC CTT GGG TTC TTA 1479 

3 5 Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 



20 



40 



485 490 
GGA GCA TAA AAG CTT CTA GA 1499 
Gly Ala Xaa Lys Leu Leu 
495 499 



CLONE CI 1.7 

GAG GTA CCT GTA TGG AAA GAA GCA ACC ACT ACT CTA 36 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
1 5 10 

4 5 TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG 75 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
15 20 25 

CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Aso 
50 30 35 

CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA GAA 153 
Pro Aan Pro Gln Glu Ile Glu Leu Val Asn Val Thr Glu 

40 45 5 0 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 
55 Asp Phe Asn Met Trp Lys Asn Lys Met Val Asp Gln Met 
55 60 
CAT GAG GAT ATA ATC AGT TTA TGG GAT GAA ACC CTA AAG 231 
His Glu Asp He He Ser Leu Trp Asp Glu Ser Leu Lvs 
65 70 75 7 

60 CCA TGT GTA AAG TTA ACC CCA CTT TGT GTT ACT CTA AAC 270 
Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
80 85 90 

TGC AGT GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 
Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr 
65 95 100 
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AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT ACG 348 
Aan Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr 

105 110 115 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 
5 Ala Thr Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 
120 125 
ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA CAC ATG AAA 426 
He Lys Asn Cys Ser Phe Asn He Thr Thr His Met Lys 
130 135 140 

10 GAT AAG GTA CAG AAA GAA TAT GCA CTT TTT TAT AAA CTT 465 
Asp Lys Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
145 150 155 

GAT ATA GTA CCA ATA GAT GAT AAT AAT ACC AGC TAT AGG 504 
Asp He Val Pro He Asp Asp Asn Asn Thr Ser Tyr Arg 
15 160 165 

TTG ATA AGT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT 54 3 
Leu He Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys 

170 175 180 

CCA ATG GTC ACC TTT GAG CCA ATT CCC ATA CAT TAT TGT 582 
20 Pro Met Val Thr Phe Glu Pro He Pro He His Tyr Cys 
185 190 
GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG 621 
Ala Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lvs 
195 200 205 

25 AAG TTC AAT GGA ACA GGA CCA TGT TCA AAG GTC AGC ACA 660 
Lys Phe Asn Gly Thr Gly Pro Cys Ser Lys Val Ser Thr 
210 215 220 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 699 
Val Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr 
30 225 230 

CAA CTG TTG TTA AAT GGC AGT CTT GCA GAA GAA GAA GTA 738 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 

235 240 245 

GTA ATT AG A TCT GTC AAT TTC ACA GAC AAT GCT AAA ATC 77 7 
35 Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys He 
250 255 
ATA ATA GTA CAG CTG AAA GAA CCT GTA GCA ATT AAT TGT 816 
He He Val Gin Leu Lys Glu Pro Val Ala lie Asn Cys 
260 265 270 

4 0 ACA AG A CCC AAC AAC AAT ACA AG A AAA GGT ATA CAT CTA 85 5 
Thr Arg Pro. Asn Asn Asn Thr Arg Lys Gly lie His Leu 
275 280 285 

GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA GAA ATA ATA 894 
Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Glu lie lie 
45 290 295 

GGA GAC ATA AGA AAA GCA TAT TGC AAG ATT AGT AAA GAA 933 
Gly Asp He Arg Lys Ala Tyr Cys Lys He Ser Lys Glu 

300 305 310 

AAA TGG AAT AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 972 
50 Lys Trp Asn Asn Thr Leu Arg Gin Val Val Lys Lys Leu 
315 320 
AGA GAA CAA TTT GGG AAT AAA ACA ATA ATT TTT AAT CGA 1011 
Arg Glu Gin Phe Gly Asn Lys Thr He lie Phe Asn Arg 
325 330 335 

55 TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT 1050 
Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe 
340 345 350 

AAC TGT GGA GGG GAG TTT TTC TAC TGT AAT ACA ACA CAA 1089 
Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gin 
60 355 360 

CTG TTT AAT AGT ACT TGG AAT AAT ACT GAA GGG ACA AAT 1128 
Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn 
365 370 37S 
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AGC ACT GAA GGA AAT AGC ACA ATC ACA CTC CCA TGC AGA 1*67 
S r Thr Glu Gly Asn Ser Thr He Thr Leu Pro Cys Ara 

380 385 
ATA AAA CAA ATT ATA AAT ATG TCG CAG GAA GTA GGA AAA 1206 
5 He tys Gin He He Aan Met Trp Gin Glu Val Gly Lvs 
390 395 400 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT AGA TGC 1245 
Ala Thr Tyr Ala Pro Pro He Arg Gly Arg He Arg Cvs 
405 410 * 4i5 

10 ATA TCA AAT ATT ACA CGA CTG CTA TTA ACA AGA GAT GGT 1284 
He Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
420 425 7 

GGT AGG AAT GTC ACA AAC AAT ACC GAN NCC TTC AGA CCT 1323 
Gly Arg Asn Val Thr Asn Asn Thr Xaa Xaa Phe Arq Pro 
15 430 435 440 

GGA GGA GGA GAC ATG AGG GAC AAT TGG AGA ACT GAA TTA 1362 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

445 450 
TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA CGA ATA 1401 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly He 
455 460 465 

GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arq Aso 
470 475 480 

25 AAA AGA GCA GCA CTA GGA CCT TTC TTC CTT GGG TTC TTA 14 79 
Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 

485 490 
GGA GCA TAA AAG CTT. CTA GA 1499 
Gly Ala Xaa Lys Leu Leu 
30 495 499 

CLONE C10. S 

G GTA CCT GTG TGG AAA GAA GCA AAC ACA ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe 
35 l 5 1Q 

TGT GCA TCA GAT GCT AAA CCA TAT GAT AGA GAA GTA CAT 76 
Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 

15 20 25 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

30 35 
AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
Asn Pro Gin Glu lie Val Leu Gly Asn Val Thr Glu Asn 

40 45 50 

4 5 TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 

55 60 
GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA AAG CCA 232 
Glu Asp He He Asn Leu Trp Asp Gin Ser Leu Lys Pro 
50 65 70 75 

TGT GTA AAG TTA ACT CCA CTC TGT GTT ACT TTA AAG TGC 2 71 
Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys 
80 85 90 

AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGC ACT ATT 310 
55 Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr He 

95 100 
ACC AAT AAT ACT ACT TTG GAG GGA CTA AGA GAA CAA ATG 349 
Thr Asn Asn Ser Ser Leu Glu Gly Leu Arq Glu Gin Met 
105 no 115 

60 ACA AAC TGC TCT TTC AAC ATC ACC ACA ACT ATA AGA GAT 388 
Thr Asn Cys Ser Phe Asn He Thr Thr Ser He Arq Aso 
120 125 * 

AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 427 
Lys Val Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
65 130 135 140 
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35 
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Gin 


Trp 


Asn 


Lys 






300 






40 


AGA 


AAA 


CAA 


TTT 




Arg 


Lys 


Gin. 


Phe 










315 




CGA 
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GGA 




Arg 


Ser 


Ser 


Gly 


4 5 


325 










TTT 


AAT 


TGT 


GGA 




Phe 


Asn 


Cys 


Gly 








340 






CAA 


CTG 


TTT 


AAT 


DO 


/** 1 H 

Gin 


Leu 


Pne 


Asn 




CCT 


AAC 


AGG 


AGT 




Pro 


Asn 


Arg 


Ser 






365 






55 


AAA 


CAA 


ATT 


ATA 




Lys 


Gin 


lie 


He 










380 




ATG 


TAC 


GCC 


CCT 




Met 


Tyr 


Ala 


Pro 


60 


390 








TCA 


AAT 


ATC 


ACA 




Ser 


Asn 


He 


Thr 








405 





GAA GAA GAT GAC AAT 
Glu Glu Asp Asp Asn 
150 

AAC ACC TCA CTC ATT 
Asn Thr Ser Val He 
160 

TTT GAG CCA ATT CCC 
Phe Glu Pro He Pro 
175 

TTT GCG ATT CTA AAG 
Phe Ala He Leu Lys 
190 

ACA GGA CCA TGT AAA 
Thr Gly Pro Cys Lys 
200 

CAT GGA ATT AGG CCA 
His Gly He Arg Pro 
215 

AAT GGC AGT CTA GCA 
Asn Gly Ser Leu Ala 
225 

GCC AAT TTC ACA GAC 
Ala Asn Phe Thr Asp 
240 

CTA AAT GAA ACT GTA 
Leu Asn Glu Thr Val 
255 

AAC AAT ACA AGA AAA 
Asn Asn Thr Arg Lys 
265 

GTA CTC TAT GCA ACA 
Val Leu Tyr Ala Thr 
280 

CAA GCA CAT TGT AAC 
Gin Ala His Cys Asn 
290 

ACT TTA GAA AAG GTA 
Thr Leu Glu Lys Val 
305 

GGG GAT AAT ACA ACA 
Gly Asp Asn Thr Thr 
320 

GGG GAC CCA GAA ATT 
Gly Asp Pro Glu lie 
330 

GGG GAA TTT TTC TAC 
Gly Glu Phe Phe Tyr 
345 

AGT ACT TGG AAT AAT 
Ser Thr Trp Asn Asn 
355 

GAC AAT ATC ACA CTC 
Asp Asn He Thr Leu 
370 

AAC ATG TGG CAG GAA 
Asn Met Trp Gin Glu 
385 

CCC ATC AGA GGG GAA 
Pro He Arg Gly Glu 
395 

GGG CTG CTA CTA ACA 
Gly Leu Leu Leu Thr 
410 



ACT AGC TAT AGA 466 
Thr Ser Tyr Arg 
155 

ACA CAG GCT TGT 505 
Thr Gin Ala Cys 
165 

ATA CAT TAT TGT 544 
He His Tyr Cys 
180 

TGT AAT GAT AAG 583 
Cys Asn Asp Lys 

AAT GTC AGC ACA 622 
Asn Val Ser Thr 
205 

CTA GTA TCA ACT 661 
Val Val Ser Thr 
220 

GAA GAA GAG GTA 700 
Glu Glu Glu Val 
230 

AAT GCT AAA ACC 739 
Asn Ala Lys Thr 
245 

AAA ATT AAT TGT 778 
Lys He Asn Cys 

AGT ATA AAT ATA 817 
Ser He Asn He 
270 

GGA GAA ATA ATA 856 
Gly Glu He He 
285 

ATT AGT AGA GCA 895 
He Ser Arg Ala 
295 

GTT GAC AAA TTA 934 
Val Asp Lys Leu 
310 

ATA GCT TTT AAT 97 3 
He Ala Phe Asn 

GTA ATG CAC ACT 1012 
Val Met His Thr 
335 

TGT AAT ACA ACA 1051 
Cys Asn Thr Thr 
350 

ACT TGG AAG GAT 1090 
Thr Trp Lys Asp 
360 

CCA TGC AGA ATA 1129 
Pro Cys Arg lie 
375 

GTA GGA AAA GCA 1168 
Val Gly Lys Ala 

ATT AGA TGT TCA 1207 
He Arg Cys Ser 
400 

AGA GAT GGT GGT 1246 
Arg Asp Gly Gly 
415 
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CCA 


TTA 


GGA 


ATA 
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Leu 
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He 
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Pro 


Thr 


Arg 


Ala 


455 
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AAA 


AGA 


GCA 


GTA 


GGA 


val 


Gin 


Arg 


Glu 


Lys 


Arg 


Ala 


Val 


Gly 






470 










475 





425 

AGG GAC AAT 
Arg Asp Asn 
440 

GTA AAA ATT 

Val Lys He 

AAG AGA AGA 
Lys Arg Arg 
465 



480 



Phe Leu Gly 
483 



20 CLONE C10. 7 

G GTA CCT GTG TGG AAA GAA GCA AAC ACA ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe 

TGT GCA TCA GAT GCT AAA GCA TAT GAT AGA GAA GTA CAT 7 6 
25 Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 
15 20 25 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

30 AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
Asn Pro Gin Glu He Val Leu Gly Asn Val Thr Glu Asn 

40 45 so 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 19 3 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
35 55 60 

GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA AAG' CCA 232 
Glu Asp lie He Asn Leu Trp Asp Gin Ser Leu Lys Pro 

65 70 75 

TGT GTA AAG TTA ACT CCA CTC TGT GTT ACT TTA AAG TGC 271 
Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lvs Cvs 
80, 85 90 

AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGC ACT ATT 310 
Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr lie 
95 ioo 
4 5 ACC AAT AAT AGT AGT TTG GAG GGA CTA AGA GAA CAA ATG 349 
Thr Asn Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met 

10S no us 

ACA AAC TGC TCT TTC AAC ATC ACC ACA AGT ATA AGA GAT 388 
Thr Asn Cys Ser Phe Asn He Thr Thr Ser He Arq Aso 
50 120 125 

AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 427 
Lys Val Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
130 135 140 

CTA GTA CCA ATA GAA GAA GAT GAC AAT ACT AGC TAT AGA 466 
55 Val Val Pro He Glu Glu Asp Asp Asn Thr Ser Tyr Arg 
14S 150 is! 

TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 
Leu He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys 
160 165 
60 CCA AAG ACA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 544 
Pro Lys Thr Ser Phe Glu Pro He Pro He His Tvr Cvs 

170 175 180 

GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG 583 
Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys 
65 185 190 
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AAG 


TTC 


AAT 


GGA 


ACA 


GGA 


CCA 


TGT 


AAA 


AAT 


GTC 


AGC 


ACA 


622 




L/3 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 


Lys 


Asn 


Val 


Ser 


Thr 






195 








200 










205 










GTA 


CAA 


TGT 


ACA 


CAT 


GGA 


ATT 


AGG 


CCA 


GTA 


GTA 


TCA 


ACT 


661 


5 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 


Ser 


Thr 










210 








215 










220 






CAA 


CTG 


TTG 


TTA 


AAT 


GGC 


AGT 


CTA 


GCA 


GAA 


GAA 


GAG 


GTA 


700 




Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 














225 








230 










10 


GTA 


ATC 


AGA 


TCT 


GCC 


AAT 


TTC 


ACA 


GAC 


AAT 


GCT 


AAA 


ACC 


739 




Val 


He 


Arq 


Ser 


Ala 


Asn 


Phe 


Thr 


Asp Asn 


Ala 


Lys 


Thr 








235 








240 










245 








ATA 


ATA 


GTA 


CAT 


CTA 


AAT 


GAA 


ACT 


GTA 


AAA 


ATT 


AAT 


TGT 


778 




lie 


He 


Val 


His 


Leu 


Asn 


Glu 


Thr 


Val 


Lvs 


He 


Asn 


Cys 




15 








250 










255 














ACA 


AGA 


CTT 


GGC 


AAC 


AAT 


ACA 


AGA 


AAA 


AGT 


ATA 


AAT 


ATA 


817 




Thr 


Arg 


Leu 


Gly 


Asn 


Asn 


Thr 


Arg 




Ser 


He 


Asn 


He 






260 










265 










270 










GGA 


CCA 


GGG 


AGA 


GTA 


CTC 


TAT 


GCA 


ACA 


GGA 


GAA 


ATA 


ATA 


856 


20 


GlV 


Pro 


Gly 


Arg 


Val 


Leu 


Tyr 


Ala 


Thr 


G 1 v 


Glu 


He 


He 








275 










280 










285 






GGA 


GAC 


ATA 


AGA 


CAA 


GCA 


CAT 


TCT 


AAC 


ATT 


AGT 


AGA 


GCA 


895 




Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Ala 








290 










295 










25 


CAA 


TGG 


AAT 


AAG 


ACT 


TTA 


GAA 


AAG 


GTA 


GTT 


GAC 


AAG 


TTA 


934 




Gin 


Trp 


Asn 


Lys 


Thr 


Leu 


Glu 


Lys 


v a x 


Va 1 


Asp 


Lys 


Leu 








300 








305 










310 








AG A 


AAA 


CAA 


TTT 


GGG 


GAT 


AAT 


ACA 


ACA 


ATA 


GCT 


TTT 


AAT 


973 




Arg 


Lys 


Gin 


Phe 


Gly 


Asp 


Asn 


Thr 


Thr 


lie 


Ala 


Phe 


Asn 




30 






315 










320 














CGA 


TCC 


TCA 


GGA 


GGG 


GAC 


CCA 


GAA 


ATT 


GTA 


ATG 


CAC 


ACT 


1012 




Arg 


Ser 


Ser 


Gly 


Gly 


A3p 


Pro 


Glu 


lie 


Val 


Met 


His 


Thr 






325 








330 










335 










TTT 


AAT 


TGT 


GGA 


GGG 


GAA 


TTT 


TTC 


TAC 


TGT 


AAT 


ACA 


ACA 


1051 


35 


Phe 


Asn 


Cys Gly Gly 


Glu 


Phe 


Phe 


Tvr 


Cvs 


Asn 


Thr 


Thr 










340 










345 










350 






CAA 


CTG 


TTT 


AAT 


AGT 


ACT 


TGG 


AAT 


AAT 


ACT 


TGG 


AAG 


GAT 


1090 




Gin 


Leu 


Phe 


Asn 


Ser 
355 


Thr 


Trp 


Asn 


Asn 


Thr 
360 


Trp 


Lys 


Asp 




40 


CCT 


AAC 


AGG 


ACT 


GAC 


AAT 


ATC 


ACA 


CTC 


CCA 


TGC 


AGA 


ATA 


1129 




Pro 


Asn 


Arg. Ser 


Asp 


Asn 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 








365 








370 










375 








AAA 


CAA 


ATT 


ATA 


AAC 


ATG 


TGG 


CAG 


GAA 


GTA 


GGA 


AAA 


GCA 


1168 




Lys Gin 


He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 




45 








380 










385 














ATG 


TAC 


GCC 


CCT 


CCC 


ATC 


AGA 


GGG 


GAA 


ATT 


AGA 


TGT 


TCA 


1207 




Met 


Tyr 


Ala 


Pro 


Pro 


He 


Arg 


Gly 


Glu 


lie 


Arg 


Cys 


Ser 






390 








395 










400 










TCA 


AAT 


ATC 


ACA 


GGG 


CTG 


CTA 


CTA 


ACA 


AGA 


GAT 


GGT 


GGT 


1246 


50 


Ser 


Asn 


lie 


Thr Gly 


Leu 


Leu 


Leu 


Thr 


Arg 


Asp Gly Gly 










405 










410 










415 






AAT 


GAC 


GAT 


GGT 


AAT 


GAC 


ACG 


ACC 


ACA 


AAC 


AGG 


ACC 


GAG 


1285 




Asn 


Asp Asp Gly 


Asn 


Asp 


Thr 


Thr 


Thr 


Asn 


Arg 


Thr 


Glu 














420 










425 










55 


ATC 


TTC 


AGA 


CCT 


GGA 


GGA 


GGA 


GAT 


ATG 


AGG 


GAC 


AAT 


TGG 


1324 




He 


Phe 


Arg 


Pro Gly 


Gly Gly 


Asp 


Met 


Arg 


Asp 


Asn 


Trp 








430 








435 










440 








AGA 


ACT 


CAA 


TTA 


TAT 


AGA 


TAT 


AAA 


GTA 


GTA 


AAA 


ATT 


CAA 


1363 




Arg 


Ser 


Glu 


Leu 


Tyr 


Arg 


Tyr 


Lys 


Val 


val 


Lys 


lie 


Glu 




60 








445 










450 














CCA 


TTA 


GGA 


ATA 


CCA 


CCC 


ACC 


AGG 


GCA 


AAG 


AGA 


AGA 


GTG 


1402 




Pro 


Leu 


Gly 


He 


Ala 


Pro 


Thr 


Arg 


Ala 


Lys 


Arg 


Arg 


Val 






455 








460 










465 
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CTG CAG AG A GAA AAA AG A GCA GTA GGA CTA GGA GCT TTG 1441 
Val Gin Arg Glu Lys Arg Ala Val Glv Leu Gly Ala Leu 
470 475 ' 480 

TTC CTT GGG TTC TTG GGA GCA TAA AGO TTC TAG A 147 5 
5 Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 

485 490 491 

CLONE C17.1 

CTC GAG GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT 36 
10 Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

1 5 io 

CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT TCA GAG 7 5 
Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu 
15 20 25 

15 GCA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA 114 
Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 

30 35 
GAC CCC AAC CCA CAA GAA GTA GAA TTG GAA AAT GTG ACA 153 
Asp Pro Asn Pro Gin Glu Val Glu Leu Glu Asn Val Thr 
20 40 45 50 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATC GTA GAA CAG 192 
Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 

55 60 
ATG CAT GGG GAT ATA ATT ACT TTA TGG GAT CAA AGC CTA 23 1 
Met His Gly Asp lie lie Ser Leu Trp Asp Gin Ser Leu 

65 70 75 

AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACG TTA 270 
Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

3 0 AAT TGC ACT GAC CCA AAT GTT ACT AAT ACC GAG AG A ACG 309 
Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr 

95 100 
ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 348 
He Glu Gly Gly Glu He Lys Asn Cys Ser Phe Asn lie 
35 105 no us 

ACC ACA AAC ATA AG A GAT AGG TTT CAG AAA GAA TAT GCA 387 
Thr Thr Asn He Arg Asp Arg Phe Gin Lys Glu Tyr Ala 

120 125 
CTT TTT TAT AAA CTT GAT GTA ATA CCA TTA GGT AAT GAT 4 26 
Leu Phe Tyr Lys Leu Asp Val lie Pro Leu Gly Asn Asp 
130 135 140 

AAT ACT AGC TAT AGG TTG ATA ACT TGT AAC ACC TCA CTC 46 5 
Asn Thr Ser Tyr Arg Leu He Ser Cvs Asn Thr Ser Val 
145 150 * 155 

45 ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 
He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He 

. 160 165 
CCC ATA CAT TAT TGT GCC CCG CCT GGT TTT GCG ATT CTA 54 3 
Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu 
50 170 175 180 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 
Lys Cyg Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 

185 190 
ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT CCA ATT AAG 621 
55 Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Lys 
195 200 205 

CCA GTA GTA TCA ACT CAA CTG TTG TTA AAT GGC ACT CTA 660 
Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
210 215 220 

60 GCA GAA GAA GAC ATA GTA ATT AGA TCC GCC AAT CTC ACA 699 
Ala Glu Glu Asp He Val He Arg Ser Ala Asn Leu Thr 

225 230 
GAC AAT GCT AAA AAC ATA ATA GTA CAG CTG AAT GAA TCT 738 
Asp Asn Ala Lys Asn He He Val Gin Leu Asn Glu Ser 
65 235 240 245 



40 



-36- 



WO 98/01564 



PCT/US97/09690 



CTA ACA ATC AAT TGT ACA AG A CCC AAC AAC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Pro Asn Asn Asn Thr Met 

250 255 
AAA AGT ATA CAT ATA GGA CCA GGC AG A GCA TTT TAT GCA 816 
5 Lys Ser He His lie Gly Pro Gly Arg Ala Phe Tyr Ala 
260 265 270 

ACA GGA AAC ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT 855 
Thr Gly Asn He He Gly Asp He Arg Gin Ala His Cys 
275 280 285 

10 AAC ATT AGT GGA ACA AAA TGG AAT GAC ACT TTG AAA AAG 894 
Asn He Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys 

290 295 
ATA GCT ATA AAA TTA AGA GAA CAA TTT AAT AAG ACA ATA 933 
He Ala He Lys Leu Arg Glu Gin Phe Asn Lys Thr lie 
15 300 305 310 

GTC TTT AAT CAA TCC TCA GGA GGG GAC CCA GAA ATT GCA 972 
Val Phe Asn Gin Ser Ser Gly Gly Asp Pro Glu lie Ala 

315 320 
ACG CTC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT 1011 
20 Thr Leu Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
325 330 335 

AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT ACT ACT 1050 
Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr 
340 345 350 

25 GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 
Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr He Thr Leu 

355 360 
CCA TGC AGA ATA AGA CAA ATT ATA AAC ATG TGG CAG AAA 1128 
Pro Cys Arg He Arg Gin He lie Asn Met Trp Gin Lys 
30 365 370 375 

ATA GGA AAA GCA ATG TAT GCC CCT CCC ATC AAA GGG CAA 1167 
lie Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin 

380 385 
ATT AGA TGT TCA TCA AAT ATT ACA GGG CTA ATA TTA ACA 1206 
35 He Arg Cys Ser Ser Asn lie Thr Gly Leu He Leu Thr 
390 395 400 

AGA GAT GGT GGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 124 5 
Arg Asp Gly Gly Asn Asn Asn Met Ser Lys Thr Thr Glu 
405 410 415 

4 0 ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGC 1284 
Thr Phe Arg. Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

420 425 
AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT CAA 132 3 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
45 430 435 440 

CCA TTA GGA GTA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1362 
Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val 

445 450 
GTG CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT GTG 1401 
50 Val Gin Arg Glu Lys Arg Ala Val Gly He Gly Ala Val 
455 460 465 

TTC CTT GGG TTC TTG GGA GCA TAA AGC TTC TAG A 1435 
Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 
470 475 478 

55 

CLONE C17.3 

CTC GAG GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT 36 
Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 
15 10 
60 CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT CAT TCA GAG 7 5 
Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu 
15 20 25 

GCA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA 114 
Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 
65 30 35 
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GAC CCC AAC CCA CAA GAA GTA GAA TTG GAA AAT GTG ACA l c 3 

Asp Pro Asn Pro Gin Glu Val Glu Leu Glu Asn Val Thr 

40 45 so 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA CAA CAG 192 
5 Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 
5S 60 
ATG CAT GGG GAT ATA ATT ACT TTA TGG GAT CAA AGC CTA 231 
Met His Gly Asp He He Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

10 AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACG TTA 270 
Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

AAT TGC ACT CAC CCA AAT GTT ACT AAT ACC GAG AGA ACG 309 
Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr 
15 95 100 

ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 348 
lie Glu Gly Gly Glu He Lys Asn Cys Ser Phe Asn lie 

105 no 115 

ACC ACA AAC ATA AGA GAT AGG TTT CAG AAA GAA TAT CCA 387 
Thr Thr Asn lie Arg Asp Arg Phe Gin Lys Glu Tvr Ala 

120 125 
CTT TTT TAT AAA CTT GAT GTA ATA CCA TTA GGT AAT GAT 426 
Leu Phe Tyr Lys Leu Asp Val He Pro Leu Gly Asn Asp 
130 135 140 H 

2 5 AAT ACT AGC TAT AGG TTG ATA ACT TGT AAC ACC TCA CTC 4 65 
Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val 
145 150 155 

ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 
He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He 
30 160 155 

CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA 54 3 
Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu 

170 175 ISO 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 
35 Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 
185 190 
ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AAG 621 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Lvs 
195 200 205 

4 0 CCA GTA GTA TCA ACT CAA CTG TTG TTA AAT GGC AGT CTA 660 
Pro Val Val. Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
210 215 220 

GCA GAA GAA GAC ATA GTA ATT AGA TCC GCC AAT CTC ACA 699 
Ala Glu Glu Asp He Val He Arg Ser Ala Asn Leu Thr 
45 225 230 

GAC AAT GCT AAA AAC ATA ATA GTA CAG CTG AAT GAA TCT 738 
Asp Asn Ala Lys Asn He He Val Gin Leu Asn Glu Ser 

235 240 245 

GTA ACA ATG AAT TGT ACA AGA CCC AAC AAC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Pro Asn Asn Asn Thr Met 

250 255 
AAA AGT ATA CAT ATA GGA CCA GGC AGA GCA TTT TAT GCA 816 
Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Ala 
260 265 270 

55 ACA GGA AAC ATA ATA GGA GAT ATA ACA CAA GCA CAT TGT 855 
Thr Gly Asn He He Gly Asp He Arg Gin Ala His Cys 
275 280 285 

AAC ATT AGT GGA ACA AAA TGG AAT GAC ACT TTG AAA AAG 894 
Asn He Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys 
60 290 295 

ATA GCT ATA AAA TTA AGA GAA CAA TTT AAT AAG ACA ATA 9 33 
He Ala He Lys Leu Arg Glu Gin Phe Asn Lys Thr He 
300 305 310 
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GTC TTT AAT CAA TCC TCA GGA GCG GAC CCA GAA ATT GCA 972 
Val Phe Asn Gin Ser Ser Gly Giy Asp Pro Glu lie Ala 

315 320 
ACG CTC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT 1011 
5 Thr Leu Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
325 330 33S 

AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT 1050 
Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr 
340 345 350 

10 GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 
Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr lie Thr Leu 

355 360 
CCA TGC AGA ATA AGA CAA ATT ATA AAC ATG TGG CAG AAA 1128 
Pro Cys Arg He Arg Gin He He Asn Met Trp Gin Lys 
15 365 370 375 

ATA GGA AAA GCA ATG TAT GCC CCT CCC ATC AAA GGG CAA 1167 
He Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin 

380 385 
ATT AGA TGT TCA TCA AAT ATT ACA GGG CTA ATA TTA ACA 1206 
20 He Arg Cys Ser Ser Asn He Thr Gly Leu He Leu Thr 
390 395 400 

AGA GAT GGT CGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 124 5 
Arg Asp Gly Gly Asn Asn Asn Met Ser Lys Thr Thr Glu 
405 410 415 

25 ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1284 
Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

420 425 
AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA 1323 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
30 430 435 440 

CCA TTA GGA GTA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1362 
Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val 

445 450 
GTG CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT GTG 1401 
35 Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Val 
455 460 465 

TTC CTT GGG TTC TTG GGA GCA TAA AGC TTC TAG A 1435 
Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 
470 475 478 

40 

In addition to the listing in Table 1, Figure 3 shows 
the alignment of the amino acid sequences of the clones 
of each of the seven isolates. Corresponding residues 
from various clones are in boxes. In the figure, the 

45 amino acid sequences are aligned against MN-rgpl20 
(SEQ. ID. NO. 29) . 

In one embodiment, a gpl20 polypeptide of this 
invention has the same amino acid sequence as the 
sequence of one of the breakthrough isolates. In 

50 another embodiment, the amino acid sequence is 

truncated, as described in detail hereinafter. In 
another embodiment, a gpl20 polypeptide sequence of 
this invention contains a substitution, insertion, or 
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deletion (alteration) of one or more amino acids in the 
sequence of a breakthrough isolate. Usually, with the 
exception of amino acids that are not present in a 
truncated amino acid sequence and eliminate an epitope, 
a gpl20 polypeptide of this invention will include 
alterations in the amino acid sequence of a 
breakthrough isolate that do not alter the 
polypeptide's ability to induce the same neutralizing 
antibodies as the amino acid sequence of the isolate. 

In general, substitutions in the amino acid 
sequence of a gp!20 polypeptide of this invention are 
conservative substitutions, particularly for amino acid 
residues in the V2 , V3 , and C4 domains of gpl20, which 
domains contain neutralizing epitopes. However, non- 
conservative substitutions, particularly in domains 
that do not contain neutralizing epitopes are 
contemplated. 

Conservative substitutions replace an amino acid 
with an amino acid of similar size and character. For 
example, a hydrophobic residue or hydrophilic residue 
is replaced with another hydrophobic residue or 
hydrophilic residue, respectively. Amino acids can be 
divided into the following groups: positively charged 
residues (K , R and H) ; negatively charged residues 
(D and E) ; amides (N and Q) ; aromatics (F, Y, and W) ; 
hydrophobics (P, G, A, V, L, I, and M) ; and uncharged 
residues (Sand T) . Usually, residues within a group 
are replaced with another member of the group. 

In one embodiment, critical amino acid residues in 
the V2, V3, and C4 domains of gpl20 are identical to 
the corresponding residues in a breakthrough isolate 
sequence. Critical amino acid residues in the V2, V3, 
and C4 domains of gpl20 are described in the 
experimental section. In another embodiment, all amino 
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acid residues in the V2, V3, and C4 domains of gpl20 
are identical to corresponding residues in a 
breakthrough isolate sequence. 

5 Oligonucleotide Encoding gp!20 from Breakthrough 
Isolates 

The present invention also provides novel 
oligonucleotides encoding gpl20 from the breakthrough 
isolates which can be used to express gpl20. An 

10 oligonucleotide of this invention encodes a polypeptide 
of this invention. The oligonucleotide can be DNA or 
RNA, usually DNA. Although numerous nucleotide 
sequences can encode the same amino acid sequence due 
to the degeneracy of the genetic code, conveniently, 

15 the oligonucleotides of this invention include a 
nucleotide sequence of a breakthrough isolate as 
illustrated in Table 1 (Sequence ID Nos. 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, and 28). Usually, 
an oligonucleotide of this invention is less than about 

20 5 kilobases (kb) , preferably less than about 3 kb. 

To express the encoded amino acid sequence, the 
oligonucleotide can be inserted into a transcription 
unit. The transcription unit can be inserted into a 
plasmid for production of cell lines, inserted into a 

25 virus (e.g.; vaccinia) or can be used directly as a DNA 
vaccine. Suitable transcription units for production 
of vaccine proteins are well known. A preferred 
expression vector, designated psvI6B5, is illustrated 
in Sequence ID No. 32. The vector includes an HSV-1 

30 gDl signal sequence joined to a linker sequence. The 
gpl20 nucleotide sequence to be expressed starts with 
the Kpn I site of the gene. Since all gpl20 or gpl60 
sequences contain this site, any gpl20 nucleotide 
sequence can be analogously inserted into the vector 

35 and expressed. The vector ends with a polyA tail from 
SV4 0. 
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In addition to being useful to express a 
polypeptide sequence of this invention, the 
oligonucleotides of this invention can also be used in 
diagnostics to detect HIV isolates. For example, the 
5 oligonucleotide or a portion thereof encoding a 

neutralizing epitope can be used in branched chain DNA 
diagnostics or as a probe in in situ hybridization 
studies. 

10 Vaccine preparation 

A gpl20 polypeptide of this invention from a 
selected breakthrough isolate (s) in a suitable carrier 
is used to make a subunit vaccine. The polypeptide can 
be used alone, but is generally administered in a 
15 multivalent subunit vaccine that includes gpl20 MN. In 
addition to one or more gp!20 polypeptides of this 
invention, the vaccine generally includes the MN 
polypeptide (hereinafter, MN-rgpl20) . The vaccine 
usually includes about 3 to about 5 different gpi20 
20 polypeptides, but 30 or more different gpl20 
polypeptides can be used. 

Preparation of gpl20 polypeptides for use in a 
vaccine is well known and is described hereinafter. 
With the exception of the use of the selected HIV 
25 isolate, the gpl20 subunit vaccine prepared in the 

method does not differ from gpl20 subunit vaccines of 
the prior aft. 

As with prior art gpl20 subunit vaccines, gpl20 at 
the desired degree of purity and at a sufficient 
30 concentration to induce antibody formation is mixed 
with a physiologically acceptable carrier. A 
physiologically acceptable carrier is nontoxic to a 
recipient at the dosage and concentration employed in 
the vaccine. Generally, the vaccine is formulated for 
35 injection, usually intramuscular or subcutaneous 

injection. Suitable carriers for injection include 
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sterile water, but preferably are physiologic salt 
solutions, such as normal saline or buffered salt 
solutions such as phosphate-buffered saline or ringer's 
lactate. The vaccine generally contains an adjuvant. 
5 Useful adjuvants include QS21 (Quillaja saponaria, 
commercially available from Cambridge Biotech, 
Worcester, MA) , which stimulates cytotoxic T-cells, and 
alum (aluminum hydroxide adjuvant) . Formulations with 
different adjuvants which enhance cellular or' local 

10 immunity can also be used. In particular, 

immunopotentiators such as cytokines can be included in 
the vaccine. Examples of suitable immunopotentiating 
cytokines include interleukins, such as interleukin-2 
(IL-2) and interleukin-12 (IL-12), and tumor necrosis 

15 factor-alpha (TNF-a) . 

Additional excipients that can be present in the 
vaccine include low molecular weight polypeptides (less 
than about 10 residues), proteins, amino acids, 
carbohydrates including glucose or dextrans, chelating 

20 agents such as EDTA, and other excipients that 
stabilize the protein or inhibit growth of 
microorganisms . 

The vaccine can also contain other HIV proteins. 
In particular, gp41 or the extracellular portion of 

25 gp41 or HIV-1 core proteins such as P24, P17, and P55 

can be present in the vaccine. Although the amino acid 
sequence of*gp41 is more conserved than that of gp!20, 
gp41 contains neutralizing epitopes. Preferably, any 
gp41 present in the vaccine is from an HIV isolate 

30 present in the vaccine. gp!60 from an isolate used in 
the vaccine can replace gpl20 in the vaccine or be used 
together with gp!20 from the isolate. Alternatively, 
gpl60 from a different isolate than those in the 
vaccine can additionally be present in the vaccine. 

35 Vaccines according to the invention can also 

contain one or more soluble gpl20 polypeptide 
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sequences, or fragments thereof, in combination with an 
engineered virus specifically designed to express 
proteins that induce a cytotoxic T-cell response. 
Suitable engineered viruses are derived from, for 
5 example, Canary Pox virus, vaccinia viruses, attenuated 
human herpes viruses (such as, e.g., herpes simplex 
viruses), and Varicella Zoster. Exemplary engineered 
viruses are modified to express any HIV protein capable 
of inducing a cytotoxic T-cell response, such as those 
10 described above. Typically, immunization with the 
gpl20/engineered virus vaccine is followed by 
administration of one or more doses of the gpl20 
polypeptide sequence (s) to boost the immune response. 
If desired, viruses can be engineered to express one or 
15 more gpl20 polypeptide sequences of the invention, or 

fragments thereof, and used in vaccines with or without 
soluble gpl20 polypeptide sequences. 

Vaccine formulations generally include a total of 
about 300 to 600 Mg of gpl20, conveniently in about 
20 1.0 ml of carrier. Preferred formulations include use 
of twice the weight of a gpl20 polypeptide in twice 
as600 /ug alum. However, formulations having smaller 
amounts (e.g.; 50 jig per dose) are also used, generally 
with alum or other adjuvants. The amount of gpl20 for 
25 any isolate present in the vaccine will vary depending 
on the immunogenicity of the gpl20. For example, gpl20 
from some strains of HIV may be less immunogenic than 
gpl20 from the MN strain (Sequence ID No. 29). If two 
strains having different immunogenicity are used in 
30 combination, empirical titration of the amount of each 
virus would be performed to determine the percent of 
the gp!20 of each strain in the vaccine. For isolates 
having similar immunogenicity, approximately equal 
amounts of each isolate* s gpl20 would be present in the 
35 vaccine. For example, in a preferred embodiment, the 

vaccine includes gpl20 from the MN and a strain of this 
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invention at concentrations of about 300 fxq per strain 
in about 1.0 ml of carrier. When the vaccine includes 
gpl20 from about 30 isolates, about 10 to about 50 jig 
can be used. Methods of determining the relative 
amount of an immunogenic protein in multivalent 
vaccines are well known and have been used, for 
example, to determine relative proportions of various 
isolates in multivalent polio vaccines. 

The vaccines of this invention are administered in 
the same manner as prior art HIV gpl20 subunit 
vaccines. In particular, the vaccines are generally 
administered at 0, 1, and at 6, 3 or 12 months, 
depending on the protocol. A preferred protocol 
includes administration at 0, l, 6, and 12 months. 
15 Following the immunization procedure, annual or 

bi-annual boosts can be administered. However, during 
the immunization process and thereafter, neutralizing 
antibody levels can be assayed and the protocol 
adjusted accordingly . 
20 The vaccine is administered to uninfected 

individuals. In addition, the vaccine can be 
administered to seropositive individuals to augment 
immune response to the virus, as with prior art HIV 
vaccines. It is also contemplated that DNA encoding 
25 the strains of gp!20 for the vaccine can be 

administered in a suitable vehicle for expression in 
the host. In this way, gpl20 can be produced in the 
infected host, eliminating the need for repeated 
immunizations. Preparation of gpl20 expression 
30 vehicles is described hereinafter. 

Although the gp!20 isolates described herein can 
be used as a vaccine as described above, the amino acid 
sequences can also be used alone or in combinations in 
the same type of formulation for use as an immunogen, 
35 to induce antibodies that recognize the isolate (s) 

present in the immunogen. Immunogens are formulated in 
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the same manner as vaccines and can include the same 
excipients, etc. Antibodies induced by the immunogens 
can be used in a diagnostic to detect the HIV strain in 
the immunogen or to affinity purify the strain, 

5 

qpl20 Polypepti de Sequences and Chemokine Receptors 
While CD4 is the primary cellular receptor for 
HIV-l, it is not sufficient for entry of HIV-l into 
cells. Co-receptors required in conjunction with CD4 

10 have been identified. These co-receptors are members 

of the chemokine receptor family of seven-transmembrane 
G-protein coupled receptors. The chemokine superfamily 
is subdivided into two groups based on the amino 
terminal cysteine spacing. The CXC chemokines are 

15 primarily involved in neutrophil-mediated inflammation, 
and the CC chemokines tend to be involved in chronic 
inflammation. At least five CC chemokine receptors, 
designated CC-CKR1-5 (also known in the art as CCR1-5) , 
and at least four CXC chemokine receptors, designated 

20 CXC-CKR1-4 (also known as CXCR-1-4) , have been 
identified. 

CXC-CKR-4 (CXCR-4), which has also been called the 
alpha-chemokine receptor fusin, serves as an entry 
cof actor for T-cell-tropic HIV-l strains. CC-CKR-5 

25 (CC-R5 ) , which has been called beta-chemokine receptor, 
together with its related family members, such as CC- 
CKR-2b and CC-CKR3, serve as entry cof actors for 
macrophage-tropic HIV-l strains. T-cell-tropic strains 
can infect primary T-cells and T-cell lines, but not 

30 macrophages, whereas macrophage-tropic strains can 

infect macrophages and primary T-cells, but not T-cell 
lines. T-cell- and macrophage-tropic strains are 
discussed more fully in Deng et. al., Nature 381:661- 
666 (1996), which is hereby incorporated by reference 

35 in its entirety. Examples of T-cell-tropic strains 
include laboratory isolates, such as IIIB and MN. 
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Macrophage-tropic strains include primary isolates, 
including but not limited to A244, GNE6, GNE8, and 
breakthrough viruses from vaccinees immunized with 
gpl20-based vaccines. Dual-tropic strains can use both 
5 types of co-receptors, entering cells via CXC-CKR-4 or 
via one or more CC-CKR family members, preferably CC- 
CKR-5, CC-CKR-2b, or CC-CKR-3. While the present 
invention is not intended to be bound or limited by any 
one theory, the entry of T-cell tropic and macrophage- 

10 tropic HIV-l strains is believed to provide a unifying 
explanation of the differences in cell tropism between 
viral strains, the resistance to HIV-l infection by 
many CD4-transf ected nonprimate cells, and the HIV-1- 
infection resistance of a portion of the human 

15 population. 

Accordingly/ in one embodiment is a vaccine 
containing (1) a first gpl20 polypeptide sequence, or 
fragment thereof, from a macrophage-tropic HIV-l strain 
and/or a second gpl20 polypeptide sequence, or fragment 

20 thereof, from a T-cell tropic strain, in combination 
with (2) a breakthrough isolate HIV gp!20 polypeptide 
sequence, or fragment thereof, from a vaccinee 
vaccinated with the first and/or second HIV gpl20 
polypeptide sequence. Preferably, the vaccine includes 

25 at least two gpl20 polypeptide sequences that bind to 
different chemokine receptors. In one embodiment, the 
vaccine includes first and second gpl20 polypeptide 
sequences that bind to different chemokine receptors. 
In addition, the breakthrough isolate gpl20 polypeptide 

30 sequence can bind to a different chemokine receptor 

than the chemokine receptor (s) bound by either or both 
of the first and second gpl20 polypeptide sequence(s). 

A preferred T-cell tropic strain is a laboratory 
isolate, most preferably MN. Preferred macrophage- 

35 tropic viruses for use in the invention are GNE6 and 
GNE8, which are representative of the breakthrough 
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viruses disclosed herein and differ from MN in that 
their gpl20s induce the formation of antibodies that 
recognize the gpl20 sequences (e.g., the V3 domain) 
involved in binding to CC chemokine receptors, such as 
5 CXC-CKR-5 . 

In one embodiment, HIV infection is prevented by 
administering one or more chemokine receptor-binding 
gpl20 polypeptide sequences, or fragment (s) thereof 
containing appropriate chemokine receptor-binding 
10 domains, in a vaccine, such as those described above. 
Preferably, the vaccine also includes one or more 
CD4-binding gpl20 polypeptide sequences or appropriate 
fragments thereof. Such vaccines induce anti-HIV 
antibodies that inhibit viral gpl20-chemokine receptor 
15 or -CD4 binding. In addition, such gpi20 polypeptides 
can directly inhibit HIV infection by binding to one or 
more co-receptors for HIV infection, such as CD4 or a 
chemokine receptor, thus providing a prophylactic or 
therapeutic effect in treating HIV infection. 
Preferably, gpl20 polypeptide sequences useful in this 
regard contain the T-cell binding (TCB) domain. 

Various uses of chemokine receptor-binding gpl20 
polypeptides are discussed below with regard to the CC 
chemokine receptor family. However, those skilled in 
the art recognize that this discussion applies equally 
to CXC chemokine receptors that act as cofactors in HIV 
infection. 

The gpl20 polypeptides can be used as a 
composition containing one or more gpl20 polypeptides, 
as described for use as a vaccine or immunogen. The 
composition can be administered, prophylactically or 
therapeutically, to a patient at risk of infection or 
in need of such treatment using the dosages and routes 
and means of administration described herein. However, 
chronic administration may be preferred and dosages can 
be adjusted accordingly. It is noted that in vivo 
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administration can also induce antibodies that bind 
viral gpl20, further inhibiting virus binding to 
CC-CKR. 

The gpl20 polypeptides can also be used in 
5 screening assays to identify antagonists of CC-CKR. 

For example, candidate antagonists can be screened for 
inhibition of binding of gpl20 to a CC-CKR CC-CKR 
receptor that is isolated and attached to a surface 
(e.g., plastic dish) or recombinantly or naturally 

10 expressed on the surface of a cell. Antagonists can 
either bind gpl20 or bind receptor. Preferred 
candidate antagonists include gpl20 compounds, small 
gpl20 peptides (5 to 20 amino acids in length, 
preferably 7 to 10 amino acids in length) or 

15 peptidomimetics of gpl20 that bind receptor, monoclonal 
antibodies that bind gpl20, and small organic molecules 
that bind either gp!20 or receptor. 

The antibodies induced by the gpl20 polypeptides 
can also be used to induce anti-idiotype antibodies 

20 that bind CC chemokines. These anti-idiotype 

antibodies can be screened for binding to an anti-gpl20 
polypeptide antibody and inhibiting gpl20 from binding 
CC-CKR receptor. Such anti-idiotype antibodies mimic 
gpl20 by binding to CC-CKR receptor. Such antibodies, 

25 preferably human antibodies, can be obtained in a 
number of ways, such as human antibodies from 
combinatorial libraries (e.g., Burton et al. Adv. 
Immunolo. (1994) 57:191-280). It is now possible to 
produce transgenic animals (e.g., mice) that are 

30 capable, upon immunization, of producing a full 
repertoire of human antibodies in the absence of 
endogenous immunoglobulin production. For example, 
homozygous deletion of the antibody heavy-chain joining 
region (JH) gene in chimeric and germ-line mutant mice 

35 results in complete inhibition of endogenous antibody 
production. Transfer of the human germ-line 
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immunoglobulin gene array in such germ-line mutant mice 
results in the production of human antibodies upon 
antigen challenge as described in Jakobovitis et al., 
Proc. Natl. Acad. Sci. USA 90: 2551 (1993); Jakobovits 
5 et al., Mature 362:255-258 (1993); Bruggermann et al., 
Year in Immuno. 7: 33 (1993). 

Alternatively, phage display technology as 
described by McCafferty et al., Nature 348:552-553 
(1990) can be used to produce human antibodies and 
10 antibody fragments in vitro from immunoglobulin 

variable (V) domain gene repertoires from unimmunized 
donors. According to this technique, antibody v domain 
genes are closed in-frame either into either a major or 
minor coat protein gene of a filamentous bacteriophage, 
such as M13 or fd, and displayed as functional antibody 
fragments on the surface of the phage particle. 
Because the filamentous particle contains a single- 
stranded DNA copy of the phage genome, selections based 
on the functional properties of the antibody also 
result in selection of the gene encoding the antibody 
exhibiting those properties. Phage display can be 
performed in a variety of formats as reviewed by, for 
example, Johnson, et al., Current Opinion in Structural 
Biology 3:564-571 (1993). 

Several sources of V-gene segments can be used for 
phage display. Clackson et al., Nature, 352: 624-628 
(1991) isolated a diverse array of anti-oxazolone 
antibodies from a small random combinatorial library of 
V genes derived from the spleens of immunized mice. A 
30 repertoire of V genes from unimmunized human donors (or 
embryonic cells) can be constructed. it has been 
demonstrated that antibodies to a diverse array of 
antigens (including self-antigens) can be isolated 
essentially following the techniques described by Marks 
35 et al., J. Mol. Biol., 222: 581-597 (1991), or Griffith 
etal., EMBOJ., 12: 725-734 (1993). 
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In a natural immune response, antibody genes 
accumulate mutations at a high rate (somatic 
hypermutation) . Some of the changes introduced confer 
higher affinity, and B cells displaying high-affinity 
5 surface immunoglobulin are preferentially replicated 

and differentiated during subsequent antigen challenge. 
This natural process can be mimicked by employing the 
technique known as "chain shuffling" (Marks et al., 
Bio/Technol . 10:779-783 [1992]). In this method, the 

10 affinity of "primary" human antibodies obtained by 

phage display can be improved by sequentially replacing 
the heavy and light chain V region genes with 
repertoires of naturally occurring variants 
(repertoires) of V domain genes obtained from 

15 unimmunized donors. This technique allows the 

production of antibodies and antibody fragments with 
affinities in the nM range. A strategy for making very 
large phage antibody repertoires has been described by 
Waterhouse et al., Nucl. Acids Res., 21: 2265-2266 

20 (1993). 

Accordingly, antibodies that bind CC-CKR can be 
obtained by screening antibodies or fragments thereof 
expressed on the surface of bacteriophage in 
combinatorial lfbraries or in other systems as 

25 described above with a gpl20 monoclonal antibody that 
inhibits gpl20 binding to receptor. 

In addition to screening antibodies with a gp-120 
antibody, random or combinatorial peptide libraries can 
be screened with either a gpl20 antibody or the gpl20 

30 compounds of the invention. Approaches are available 
for identifying peptide ligands from libraries that 
comprise large collections of peptides, ranging from 1 
million to 1 billion difference sequences, which can be 
screened using monoclonal antibodies or target 

35 molecules. The power of this technology stems from the 
chemical diversity of the amino acids coupled with the 
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large number of sequences in a library. see for 
example, Scott et al., Cur. Open. Biotechnol. 5(l):40-8 
(1994); Kenan et al. Trends Biochem. Sci. (1994) 
19(2):57-64. Accordingly, the monoclonal antibodies, 
preferably human monoclonals or fragments thereof, 
generated as discussed herein, find use in treatment by 
inhibiting or treating HIV infection or disease 
progression, as well as in screening assays to identify 
additional pharmaceuticals. 

Production of gp isn 

gpl20 for a vaccine can be produced by any 
suitable means, as with prior art HIV gpl20 subunit 
vaccines. Recombinantly-produced or chemically 
synthesized gpl20 is preferable to gpl20 isolated 
directly from HIV for safety reasons. Methods for 
recombinant production of gpl20 are described below. 

Oligonucleotides encoding gpl20 from breakthrough 
isolates and capable of expressing gpl20 can be 
prepared by conventional means. For example, the 
nucleotide sequence can be synthesized. Alternatively, 
another HIV nucleotide sequence encoding gpl20 can be 
used as a backbone and altered at any differing 
residues as by site-directed mutagenesis. 
Site-directed mutagenesis is described in Kunkel et al, 
Proc. Natl. Acad. Sci. (USA) 82:488-492 (1985) and 
Zoller et al, Nuc. Acids Res. 10:6487-6500 (1982) and 
is well known. 

In a preferred embodiment, the nucleotide sequence 
is present in an expression construct containing DNA 
encoding gpl20 under the transcriptional and 
translational control of a promoter for expression of 
the encoded protein. The promoter can be a eukaryotic 
promoter for expression in a mammalian cell. In cases 
where one wishes to expand the promoter or produce 
gpl20 in a prokaryotic host, the promoter can be a 
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prokaryotic promoter. Usually a strong promoter is 
employed to provide high-level transcription and 
expression. 

The expression construct can be part of a vector 
5 capable of stable extrachromosomal maintenance in an 
appropriate cellular host or may be integrated into 
host genomes. Normally, markers are provided with the 
expression construct which allow for selection of a 
host containing the construct. The marker can be on 

10 the same or a different DNA molecule, desirably, the 
same DNA molecule. 

The expression construct can be joined to a 
replication system recognized by the intended host 
cell. Various replication systems include viral 

15 replication systems such as those from retroviruses, 

simian virus, bovine papilloma virus, or the like. In 
addition, the construct may be joined to an amplifiable 
gene, e.g. the DHFR gene, so that multiple copies of 
the gpl20 DNA can be made. Introduction of the 

20 construct into the host will vary depending on the 

construct and can be achieved by any convenient means. 
A wide variety of prokaryotic and eukaryotic hosts can 
be employed for expression of the proteins. 

Preferably, the gpl20 is expressed in mammalian 

25 cells that provide the same glycosylation and disulfide 
bonds as in native gpl20. Expression of gpl20 and 
fragments of gpl20 in mammalian cells as fusion 
proteins incorporating N-terminal sequences of Herpes 
Simplex Virus Type 1 (HSV-l) glycoprotein D (gD-1) is 

30 described in Lasky, L. A. et al., 1986 (Neutralization 
of the AIDS retrovirus by antibodies to a recombinant 
envelope glycoprotein) Science 233: 209-212 and Haffar, 
O.K. et al., 1991 (The cytoplasmic tail of HIV-1 gp!60 
contains regions that associate with cellular 

35 membranes.) Virol. 180:439-441, respectively. A 

preferred method for expressing gpl20 is described in 
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the examples. In the examples, a heterologous signal 
sequence was used for convenient expression of the 
protein. However, the protein can also be expressed 
using the native signal sequence. 
5 An isolated, purified gpl20 polypeptide having one 

of the amino acid sequences illustrated in Table 1 can 
be produced by conventional methods. For example, the 
proteins can be chemically synthesized. In a preferred 
embodiment, the proteins are expressed in mammalian 
10 cells using an expression construct of this invention. 
The expressed proteins can be purified by conventional 
means. A preferred purification procedure is described 
in the examples. 



15 gpl20 Fragments 

The present invention also provides gpl20 
fragments that are suitable for use in inducing 
antibodies for use in a vaccine formulation. A 
truncated gpl20 sequence, as used herein, is a fragment 
20 of gpl20 that is free from a portion of the intact 

gpl20 sequence beginning at either the amino or carboxy 
terminus of gp!20. A truncated gpl20 sequence of this 
invention is free from the C5 domain. The C5 domain of 
gpl20 is a major immunogenic site of the molecule. 
25 However, antibodies to the region do not neutralize 

virus. Therefore, elimination of this portion of gpl20 
from immunogens used to induce antibodies for 
serotyping is advantageous. 

In another embodiment, the truncated gpl20 
30 sequence is additionally free from the carboxy terminal 
region through about amino acid residue 4 53 of the 
gpl20 V5 domain. The portion of the V5 domain 
remaining in the sequence provides a convenient 
restriction site for preparation of expression 
35 constructs. However, a truncated gpl20 sequence that 
is free from the entire gpl20 V5 domain is also 



-54- 



WO 98/01564 



PCT7US97/09690 



suitable for use in inducing antibodies. 

In addition, portions of the amino terminus of 
gpl20 can also be eliminated from the truncated gpl20 
sequence. In particular, the truncated gpl20 sequence 
5 can be free from the gpl20 signal sequence. The 

truncated gpl20 sequence can be free from the carboxy 
terminus through amino acid residue 111 of the gpl20 Cl 
domain, eliminating most of the Cl domain but 
preserving a convenient restriction site. However, the 
10 portion of the Cl domain through the V2 cysteine 

residue that forms a disulfide bond can additionally be 
removed, so that the truncated gp!20 sequence is free 
from the carboxy terminus through amino acid residue 
117 of the gpl20 Cl domain. In a preferred embodiment, 
15 the truncated gpl20 sequence is free from the amino 

terminus of gpl20 through residue 111 of the Cl domain 
and residue 453 through the carboxy terminus of gpl20. 

The truncated gpl20 sequences can be produced by 
recombinant engineering, as described previously. 
20 Conveniently, DNA encoding the truncated gpl20 sequence 
is joined to a heterologous DNA sequence encoding a 
signal sequence. 

It is understood that the application of the 
teachings of the present invention to a specific 
25 problem or situation is within the capabilities of one 
having ordinary skill in the art in light of the 
teachings contained herein. Examples of the products 
of the present invention and representative processes 
for their isolation, use, and manufacture appear below, 
30 but should not be construed to limit the invention. 
All literature citations herein are expressly 
incorporated by reference. 

35 EXAMPLES 

Materials and Methods 
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Specimen collect! n from human volunt ers. Blood 
was collected from MN-rgpl20-immunized individuals who 
were infected with HIV-l while participating in Phase I 
(NIH Protocol AVEG 016) and Phase II (NIH Protocol 
5 AVEG 201) HIV-l vaccine trials sponsored by the 

National Institutes of Health (NIH) . The demographics 
of the subjects in the study, and the study design have 
been described in McElrath; Seminars in Cancer Biol. 
6:l-ll (1995); McElrath et al.; Abstracts from Eighth 
Annual Meeting of the National Cooperative Vaccine 
Development Groups for AIDS. Bethseda, MD 21S (1996). 
Specimens were obtained according to an informed 
consent protocol approved by the institutional review 
boards of the participating institutions. m the 
experimental section, the time of HIV-l infection is 
specified with regard to data provided by the NIH AIDS 
Vaccine Evaluation Network where PCR (RNA) and/or 
serologic assays were used to detect HIV-l infection. 



20 



sample preparation for cloning HIV-l envelope 
glycoproteins. Peripheral blood mononuclear cells 
(PBMCs) from HIV-l infected vaccinees were prepared 
from heparinized venous blood by FICOLL-HYPAQUE 
gradient centrif ugation . Cell number and viability 
25 were determined. After separation, PBMCs were washed 
twice in phosphate-buffered saline and suspended at a 
cell density of 6x10* cells/ml in PCR lysis buffer 
(50 mM KCl, 10 mM Tris (pH 8.4), 2.5 mM MgCl,, 0.1 mg/ml 
gelatin (Sigma), 0.45% NONIDET P40 detergent, 
30 0.45% TWEEN 20 detergent (both detergents are 

commercially available from United States Biochemical 
Corp.) and 0.06 mg/ml Proteinase K (Gibco BRL) to lyse 
the cells. The lysate was incubated at 50-60°C for 
1 hour, followed by inactivation of the Proteinase K at 
35 95«C for 10 minutes. Lysates were shipped frozen and 
stored at -70°C until use. 
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Polymerase chain reaction (PGR) amplificati n. 

Samples were subjected to two rounds of PGR 
amplification using the nested primers described below. 
In the first round, 25 pi aliquots of PBMC lysates 
5 (containing about 1 /xg genomic DNA) were mixed with an 
equal volume of a PCR reaction mix containing 400 /xM 
each dNTP, 200 /xg/ml BSA (Sigma Chemical Corporation, 
RIA grade) and about 100 pmoles of each primer in 50 mM 
KC1, 20 mM Tris (pH 8.4) and 3 mM MgCl,. After an 

10 initial 10 minute denaturation step at 95°C / 5 units of 
Taq polymerase ( AMPLITAQ, Perkin Elmer Cetus) were 
added during an 55°C soak step, and samples were 
overlayed with mineral oil. 

The PCR profile was as follows: 2 cycles having 

15 1 minute at 55°C, 2.5 minutes at 72°C and 1 minute at 
94 °C, followed by 28 cycles with 30 seconds at 55°C, 
2.5 minutes at 72°C and 45 seconds at 94°C, and an 
extension step at 72°C for 5 minutes. 

Aliquots of 10 ^1 from the first-round reactions 

20 were re-amplified with appropriate nested primers in a 
final reaction volume of 100 pi, using either the 
reagents and profile described above or the reagents 
and profile described in the PCR Optimizer Kit 
(Invitrogen . ) PCR reaction products were purified 

25 using QIAQUICK-spin columns (Qiagen Inc.) The primer 
pair used in the first round was either 120.os.F 
( 5 1 -gggaattcggatccAGAGCAGAAGACAGTGGCAATG A with 
homologous sequence at position 6248-6270 of HIVPV22) 
(SEQ. ID. NO. 34) or JM11A 

30 ( 5 1 -ctcgag-CTCCTGAAGACAGTCAGACTCATCAAG at position 

6048-6074) (SEQ. ID. NO. 35) in the forward direction 
[Kusumi et al.; J. Virol. 66:875 (1992)] combined with 
12 0 • os . R ( 5 1 -ggtctagaagctttaGCCCATAGTGCTTCCTGCTGCT-CC 
at position 7836-7859) (SEQ. ID. NO. 36) in the reverse 

35 direction. The internal nested primers were 120.BX.F 
(5 1 -gggcggatcctcgaGGTACCTGTRTGGAAAGAAGCA at position 
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6389-6410; R: A or G) (SEQ. ID. NO. 37) and 120. is. R 
(5'-ggtctagaagctttaTGCTCCYAAGAACCCAAGGAACA at position 
7819-7841; Y: T or C) (SEQ. ID. NO. 38). Heterologous 
primer sequences are shown in lower case letters. 

Subcloning of PGR products and the expression of 
recombinant envelope glycoproteins as fusion proteins. 

The HIV-i envelope glycoprotein gpi 2 0 sequences were 
cloned and expressed as chimeric genes and fusion 
proteins, where the signal sequence and 27 amino acids 
from the mature N terminus of herpes simplex virus 
type 1 (HSV-l) were fused to the N-terminal sequences 
of the gpi20 genes, corresponding to amino acid 13 of 
the mature gpi 2 0 sequence. PC R products containing 
9P120 sequences from the breakthrough specimens were 
cloned into pRK5 expression plasmid as chimeric genes 
using combinations of restrictions sites engineered 
into the heterologous PCR primer tails and the Xho I 
site engineered into the N-terminal sequence of 
HSV-l gD. 

The resulting double-stranded DNA was sequenced 
with Sequenase and the dGTP Reagent Kit (United States 
Biochemical Corp.). Sequences from glycoprotein D were 
provided to enhance expression and to provide a flag 
epitope to facilitate protein analysis, as described in 
Berman et al. ; J. Virol. 7:4464-9 (1992); Nakamura et 
al.; AIDS and Human Retroviruses 8:1875-85 (1992); and 
Nakamura et al.; J. Virol. 67:6179-91 (1993). 

Briefly, isolated DNA fragments generated by the 
PCR reaction were ligated into a plasmid (pRK.gD-5, 
pRKgDstop) designed to fuse the gpi20 fragments, in 
frame, to the 5' sequences of the glycoprotein D (gD) 
gene of Type 1 Herpes Simplex Virus (gD-l)and the 3' 
end to translational stop codons. The fragment of the 
gD-1 gene encoded the signal sequence and 25 amino 
acids of the mature form of HSV-i protein. To allow 
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for expression in mammalian cells, chimeric genes 
fragments were cloned into the pRK5 expression plasmid 
(Eaton et al., Biochemistry 291:8343-8347 (1986)) that 
contained a polylinker with cloning sites and 
5 translational stop codons located between a 

cytomegalovirus promotor and a simian virus 40 virus 
polyadenylation site. 

The resulting plasmids were transfected into the 
293s embryonic human kidney cell line (Graham et al., 

10 J. Gen. Virol. 36:59-77 (1977)) using a calcium 

phosphate technique (Graham et al., Virology 52:456-467 
(1973)). Growth conditioned cell culture media was 
collected 48 hr after transfection, and the soluble 
proteins were detected by ELI SA or by specific 

15 radioimmunoprecipitation where metabolically labeled 
proteins from cell culture supernatants were resolved 
by sodium dodecyl sulfate polyacry lamide gel 
electrophoresis (PAGE) and visualized by 
autoradiography as described in Berman et al., 

20 J . Virol. 63:3489-3498 (1989) and Laemmli, Nature 
227:680-685 (1970) . 

Serologic assays. Sera were assayed for 
antibodies to rgpl20, antibodies to synthetic gpl20 V3 

25 domain peptides corresponding to sequences from the 
gpl20 V3 domain, and antibodies able to inhibit the 
binding of MN-rgpl20 to cell surface CD4 using 
serologic assays described in Berman et al.; J. Virol. 
7:4464-9 (1992); Nakamura et al.; AIDS and 

30 Human Retroviruses 8:1875-85 (1992); and Nakamura et 
al.; J. Virol. 67:6179-91 (1993). Endpoint titers of 
antibody binding to gpl20 and V3 peptides were 
determined using three fold-serial dilutions of sera. 
The endpoint dilution titer was defined as the last 

35 dilution that produced an optical density value that 
was two times higher than the mean of the optical 
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densities of 1:50 diluted, pooled, normal human sera. 
Antibody titers were calculated by a computer program 
that interpolated values between antibody dilutions. 
The inter-assay coefficient of variation of positive 
5 control standard sera was 35%. 

Binding of monoclonal antibodies to rgpi20 from 
breakthrough viruses. An ELISA similar to that 
described by Moore et al.; AIDS 3:155-63 (1989) was 
10 used to measure the binding of various monoclonal 

antibodies (MAbs) to rgpl20s from breakthrough viruses. 
Briefly, Nunc-Immuno plates (Maxisorp, certified) were 
coated (100 pi at 5 pg/ml in PBS at 4 °C overnight) with 
an aff inity-purif ied sheep polyclonal antiserum to a 
15 peptide at the C terminus of gpl20 (D7324, 

International Enzymes, Fallbrook, CA) . After washing 
once with PBS-0.05% TWEEN-20 detergent, the plates were 
blocked with PBS-1.0% BSA for 30-60 minutes at room 
temperature. Cell culture supernatants from 293s 
20 cells, diluted to contain equivalent amounts of the 

gD-rgpl20 fusion protein, were added and incubated for 
2 hours at room temperature followed by three washes 
with PBS-0.05% TWEEN-20 detergent. Various MAbs were 
diluted in PBS-1.0% BSA and 100 of the diluted MAbs 
25 were added to each well and incubated for 1 hour at 
room temperature. 

The plates were washed 3 times and incubated with 
100 pi of a horseradish peroxidase-con jugated second 
antibody (goat anti-mouse or anti-human IgG, Cappel) 
30 for 1 hour at room temperature. After 3 washes the 

plates were developed and the OD 4t# , (optical density at 
492 nm) read in a plate reader. Growth conditioned 
cell culture supernatants were normalized by dilution 
based on binding by MAb 5B6 which is specific for HSV-1 
35 glycoprotein D fusion protein. 
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Virus n utralization assays* The ability of 
vaccinee sera to inhibit infection of MT4 cells by 
HIV-1 MN was measured in a cytopathicity assay where cell 
viability was quantitated using a calorimetric 
5 indicator dye, as described in Robertson et al.; 

J. Virol. Methods 20:195-202 (1988). Briefly, a virus 
stock of HIV-1 MN (obtained from Dr. Michael Norcross, 
U.S. Food and Drug Administration) was prepared as the 
clarified supernatant from chronically infected 

10 H9/HIV-1 MN cell culture. H9 cells chronically infected 
with HIV-MN were pelleted and resuspended in one-tenth 
the original volume of medium. Cell-associated virus 
was released by the mechanical shearing effects of 
rapid vortexing of the cells as described in Wrin et 

15 al.; J . ViroJ . 69:39-48 (1995). 

An amount of virus sufficient to ensure complete 
cell lysis killing in 7 days was incubated with 
three-fold serial dilutions of test antisera, and then 
used to challenge MT4 T-lymphoid cells in 10% 

20 FCS/RPMI-1640 cell culture media. The cultures were 
incubated for 7 days at 37°C in 5% CO,, and then cell 
viability was tested by the dye MTT, as described by 
Robertson et al.; J. Virol. Methods 20:195-202 (1988). 
Virus neutralization endpoints were quantitated by 

25 measurement of OD at 570-650 nm, and then the endpoint 
titers were calculated as the reciprocal of the 
antiserum dilution giving a signal that was two-fold 
above the control signal with unprotected (killed) 
cells. These titers were typically twice those 

30 calculated at 50% protection. 

Results 

Immunization history of infected subjects. Since 
1992, 499 adults have been immunized with MN-rgpl20 in 
35 Phase I trials in low or moderate risk individuals and 
in a Phase II clinical trial involving moderate to high 
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risk individuals- The studies described herein entail 
the genetic and immunologic characterization of the 
first seven of nine individuals who became infected 
with HIV-l through high risk behavior during the course 
5 of these trials. A listing of the trials and summary 
of the status of the vaccinees is presented in 
Table 2A. A listing of the analysis of the vaccinees 
is presented in Table 2B. 



10 TABLE 2 A 

Description of Vaccinees Infected with HIV-l 
After Immunization with MN-rgpl20 





Studv No. 


Case No. 


*Risk Group 


:<:Antigen dose/ 
Adiuvant 


15 


016 


C6 


M/H 


300/QS21 




016 


C8 


M/H 


600/QS21 




016 


C15 


M/H 


300/QS21 




201 


C7 


M/H 


600/Alum 




201 


Cll 


M/H 


600/Alum 


20 


201 


C10 


M/IDU 


600/Alum 




201 


C17 


M/IDU 


600/Alum 




* - M/H 


indicates 


male homosexual 


; M/IDU 



indicate male intravenous drug user. 
t - numbers indicate dose in micrograms of 
25 MN-rgpl20 injected per immunization; QS21 

indicates antigen was formulated in QS21 adjuvant; 
Alum indicates MN-rgpl20 formulated in aluminum 
hydroxide. 
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TABLE 2B 

Description of Vaccin s Infected with HIV-l 
After Immunization with MN-rgpl2 0 







Injection 


Injections 


Time of 


ninterval: 


c 
D 






before 


HIV-1+ 


to HIV-1+ 




No. 


(months) 


HIV-1+ 


f months) 


fmonths) 




to 


U , J. , -LU • -> 


2 


4 . 00 


2 . 00 




C8 


0,1 


2 


4 . 00 


3.00 




CIS 


0,1,2 


3 


6 . 25 


4 . 00 


10 


C7 


0, 1, 6, 12 


3 


9.25 


3 . 00 




Cll 


0,1,6,12 


4 


19 . 50 


6.75 




CIO 


0,1,6,19 


3 


19 . 50 


13 . 50 




C17 


0, 1, 6, 18 


4 


24 . 75 


6.25 




a 


- indicates 


interval between last 


immunization 


15 


and 


detection of 


HIV-l infection. 





Three of the infections occurred in a Phase I 
trial (NIH Protocol AVEG 201) that compared the safety 
and immunogenicity of MN-rgpl20 formulated in two 

20. different adjuvants (alum and QS21), and four of the 
infections occurred in a Phase II trial aimed at 
establishing the safety and immunogenicity of MN-rgpl20 
in various high risk groups (e.g., intravenous drug 
users, homosexual and bisexual males, and partners of 

25 HIV-l infected individuals) . 

Of the seven infections studied (Table 3), two (C6 
and C8) occurred after two injections, three (C7, C10 
and C15) occurred after three injections, and two (Cll 
and C17) occurred after receiving the four scheduled 

30 injections. The interval between receiving the last 
immunization and becoming infected was 2 to 
13.5 months. 
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TABLE 3 

P ak Post Boost MN-rgpl2 0 Antibody Titers 
in Vaccinees that Became Infected with HIV-l 

5 Injections C6 C8 CIS CI Cll COO C17 

1 <50 2185 79 <50 2890 na na 

2 21539 10125 na 413 32696 7771 7056 

3 # # 4460 9707 34728 11627 1841 

3 

4 ###### 1134 

0 

10 # - indicates specimen not analyzed because of 

HIV-l infection. 

na - indicates the sample was not available for 
testing . 

boldface - indicates unusually low antibody 
15 titers. 



Antibody response to gpl2 0 in vaccinated 
individuals. The magnitude and specificity of the 
antibody response to MN-rgpl20 was measured by ELISA in 
all infected individuals throughout the course of the 
immunization regime (Figure 1) . Five of the seven 
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subjects exhibited normal antibody response kinetics 
that included a small but reproducible primary response 
(1:100-1:2,000) and a strong secondary (booster) 
response (titters ranging from 1:7,000-1:32,000), and 
5 antibody responses following third and fourth 

injections that were similar or marginally higher than 
those achieved after the second immunization (Figure 1, 
Table 3) . 

The antibody response observed in C7 (Figure 1C) 

10 was unusual in that no antibodies were detectable after 
the primary injection and a titer of only 1:350 was 
detected after the second injection. It thus appeared 
that C7 did not respond to the primary immunization, 
and that the antibody response obtained after the 

15 second injection represented a primary immune response. 
Consistent with this hypothesis, the third injection 
elicited a titer of only 1:9,707, typical of those 
normally seen after two immunizations. 

An atypical antibody response was also seen in 

20 subject C15 (Figure 1G) who was immunized according to 
an accelerated immunization schedule of 0, 1, and 2 
months. As expected, the antibody titer seen in this 
subject (1:4,460) was at the low end of what is 
typically achieved after two immunizations and was far 

25 below normal values for three immunizations. The lack 
of an effective booster response after the third 
immunization of CIS was not surprising in view of 
previous studies where an accelerated 0, l, and 2 month 
immunization schedule in baboons [Anderson et al.; 

30 J. Infect. Dis. 160:960-9 ((1989)] similarly prolonged 
the secondary response and failed to elicit an 
effective tertiary booster response. 

Retrospective analysis of serum and plasma from 
subjects C6 (Figure 1A) and C8 (Figure IB) indicated 

35 that they became infected with HIV-l at some point 

between the second and third immunizations. Serologic 
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evidence of HIV-l infection was evident in the gpl20 
antibody assays where the titers failed to decline two 
weeks after the second injection and instead formed an 
uncharacteristic high titer plateau (Figures 1A and 
5 IB) . A similar plateau in MN-rgpl20 titer after the 
third injection, suggested that subject C7 became 
infected around week 36, approximately 16 weeks after 
receiving the third injection (Figure 1C) . Subjects 
CIO (Figure IE), Cll (Figure ID) , C15 (Figure 1G) , and 

10 C17 (Figure IF) developed unexpected increases in gpl20 
titers, typical of HIV-l infection, after either the 
third or fourth immunizations. The data obtained 
demonstrate that immunologic priming for MN-rgpl20 
antibody responses is insufficient to provide universal 

15 protection from HIV-l infection. 

Antibody titers to the V3 domain. To further 
characterize the antibody response to gpl20, antibody 
titers were measured to a synthetic V3 domain peptide 

20 of MN-rgpl20 containing the principal neutralizing 
determinant (PND) . Five of the seven subjects 
developed good V3 titers (1:400 to 1:4000) after the 
second immunization, however two subjects (C7 and C15) 
required three immunizations before developing 

25 significant tiers (Figures 1C and 1G) . As had been 
observed previously (11), the peak V3 titers in some 
individuals (e.g. Cll, C10, C17) appeared to decline 
with each successive immunization (Figures ID, IE, and 
IF). After HIV-l infection, two patterns of V3 

30 reactivity were observed. Three subjects (C6, C7, and 
C10) showed large increases in titer to V3 domain 
peptides (Figures 1A, 1C, and IE) whereas C8 
(Figure IB) showed a large decrease in V3 titer. At 
the time of analysis, the data were insufficient to 
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draw any conclusions regarding the changes in V3 titers 
in response to HIV-1 infection in subjects Cll, CIS and 
C17. 

The results obtained indicate that the ability to 
5 form antibodies reactive with the V3 domain at various 
time-points prior to HIV-1 infection is not a valid 
correlate of protective immunity against all strains of 
HIV-1. 

10 CD4 Inhibition titers. Antibodies that block the 

binding of gpl20 to CD4 represent a heterogeneous class 
of virus neutralizing antibodies. Some are known to 
bind to the C4 domain of gpl20 [Nakamura et al.; 
J. Virol. 67:6179-91 (1993); Anderson et al.; J. 

15 Infect. Ois. 160:960-9 ((1989)), and some are known to 
recognize conformation dependent discontinuous epitopes 
[Berman et al.; J. Virol. 7:4464-9 (1992); 
Nakamura et al.; J. Virol, 67:6179-91 (1993); 
McKeating et al.; AIDS Research and Human Retroviruses 

20 8:451-9 (1992); Ho et al.; J . Virol. 65:489-93 (1991); 
Barbas et al.; Proc . Natl. Acad. Sci . USA 91:3809-13 
(1994) ] . 

One way to detect antibodies to both types of 
epitopes is to measure the ability of vaccinee sera to 

25 prevent the binding of [ U *I ) -labeled gpl20 to cell 
surface CD4 [[Nakamura et al.; AIDS and Human 
Retroviruses 81875-85 (1992); Nakamura et al.; 
J. Virol. 67:6179-91 (1993)]. CD4 blocking titers were 
detected in all seven of the vaccinees prior to 

30 infection (Figure 2) with peak titers that ranged from 
1:10-1:300. At the last time point prior to infection, 
the CD4 titers in five of the seven vaccinees was low 
(1:30 or less). One vaccinee (C17), however, possessed 
a CD4 blocking titer of about 1:300 prior to infection 

35 (Figure 2F) . Thus, the lack of antibodies that block 
the binding of MN-rgpl20 to CD4 cannot account for all 
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of the infections. Large increases in CD4 blocking 
titers (1:100-1:1,000) were seen in five of the seven 
subjects after HIV-1 infection. These included 
vaccinees C6, C7, C8, C10, and cii. These results 
5 demonstrate that the CD4 blocking titers elicited by 
MN-rgpl20 were lower than those elicited by natural 
infection. 

Virus neutralizing activity. The virus 
10 neutralizing activity of antisera from 

MN-rgpl20-immunized subjects was measured using a 
colorimetric assay that measured the viability of MT-4 
cells after incubation with antibody treated virus 
(HIV-l MN ) . Since the actual date of infection was not 
15 known for any of the breakthrough infections, and serum 
samples were collected infrequently, the magnitude of 
the neutralizing antibody response at the time of 
infection is not known for any of the vaccinees. 

Of the seven infections examined, the serum sample 
20 closest to the time of infection was that obtained from 
C7 # where a neutralizing titer of 1:15 to HIV-l MN was 
present three weeks prior to detection of HIV-i 
infection (Table 4). In all other cases, however, the 
interval between the last injection and the time of 
25 infection was 10 to 25 weeks. 
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10 



15 



20 



25 



TABLE 4 

N utralization Activity of Sera from Vaccin es 
Infected with HIV-1 



Week 


C6 






CI 




PI 0 


0 


<10* 


<10* 


<10* 


<10* 


<10* 


<10* 


2 


<10 


<10 


<10 








4 


<10* 


<10* 


nd* 


<10* 


<10* 


<10* 


6 


10 


80 




<10 


30 


150 


8 






nd* 


_^ 






10 






35 






_ 


15 








<10 






16 


150# 


250# 






30 


10 


24 






150# 


<10* 


20* 


<10* 


26 








70 


500 


200 


30 








_ 




40 


33 








15 






35 










100 




36 








30# 


_ 


10 


52 










30* 


<10 


54 










250 


- 


57 










100 


- 


63 










90 




64 














77 










40# 




78 












500# 


80 














84 




















-69 









CI 7 



<10 
<10* 
400 
100 



40 
<10 



<10 

10* 
100 
60 
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90 150 
104 150# 



* - indicates immunization. 

# - indicates HIV-l positive, 
nd - indicates not done. 

- - indicates sample not available, 



When sera from the two early infections were 
0 examined (Table 4), one individual (C6) had a peak 

neutralizing titer of 1:10 ten weeks prior to detection 
of HIV-l infection, whereas the other individual (C8) 
had a neutralizing titer of 1:80 ten weeks prior to 
detection of HIV-l infection. Subject CIS, who was 
5 immunized according to an accelerated immunization 

schedule, developed a neutralizing titer of 1:35 after 
the third injection, 14 weeks prior to HIV-l infection. 
Subject CIO, who had a peak neutralizing titer of 1:200 
following the third immunization (week 24), had no 
3 detectable titer at week 52, six months prior to the 
first indication of HIV-l infection (week 78) . 

Subject Cll possessed a neutralizing titer of 1:90 
at fourteen weeks prior to detection of HIV-l and a 
peak titer of 1:500 following the third immunization. 
5 Similarly vaccinee C17 had a neutralizing titer of 
1:150 fourteen weeks prior to infection and a peak 
titer of 1:400 at two weeks after the third 
immunization. 

Based on the rate of decay of the gpl20 response 
0 of approximately two months [Belshe et al.; JAMA 

272(6) M75-80 (1994)], as well as the observation that 
neutralizing titers of 1:150 decayed to 1:10 in 10 
weeks in vaccinees C10 and C17, it appears that 
neutralizing titers in C8, C15, Cll, and C17 could have 
3 declined to 1:10 or less in the intervals between the 
last pre-inf ection serum sample and the time of HIV-l 
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detection. 

The results of these studies demonstrated that all 
vaccinees developed some level of virus-neutralizing 
antibodies at some time prior to HIV-l infection, and 
5 that the magnitude of the neutralizing response was 

probably low at the time of infection. In general, the 
magnitude of the virus-neutralizing response observed 
in the individuals that became infected with HIV-l was 
comparable to that seen in non-infected vaccinees as 
10 described in Belshe et al.; JAMA 272 ( 6 ): 475-80 (1994). 

Sequences of Viruses. To evaluate the similarity 
of the breakthrough viruses with the vaccine antigen, 
nucleotide sequences for gpl20 from all seven 

15 breakthrough viruses were determined. Envelope 

glycoprotein genes were amplified from proviral DNA 
using the polymerase chain reaction. Sequences were 
obtained by direct amplification of DNA from lysates of 
gradient-purified lymphocytes obtained directly from 

20 patient blood without any intermediate tissue culture 
or amplification step. 

A listing of the complete gp!20 sequences (two 
clones per specimen) is provided in Figure 3. All 
seven envelope glycoproteins possessed sequences 

25 typical of subtype (clade) B viruses. The overall 

homology with MN-rgpl20 ranged from 69-80% (Table 5) . 
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TABLE 5 

Comparison of MN-rgpl20 Sequence with Sequences 
from infected Vaccinees* 



MN C6.1 C8.3 C7.2 C11.5 CIO. 5 C17.1 CIS. 2 



MN 


100 79 78 


70 


75 


69 


80 


72 


C6.1 


100 78 


70 


81 


75 


90 


79 


C8.3 


100 


68 


80 


76 


84 


83 


C7.2 




100 


80 


73 


76 


73 


C11.5 






100 


75 


70 


80 


CIO. 5 








100 


70 


72 


C17.1 










100 


87 


CIS. 2 












100 




- Data indicate 


percent 


identity . 







15 Interestingly, a high percentage (four of seven) of the 
breakthrough viruses differed from MN-rgpl20 by 25-30% 
[Myers et al.; Retroviruses and AIDS Database, Los 
Alamos National Laboratory (1992 and 1995)]. 
Historically this degree of sequence variation is 

20 typical of inter-subtype ( intra-clade) variation rather 
than intra-subtype variation which is expected to be in 
the 10-20% range [Myers et al.; Retroviruses and AIDS 
Database, Los Alamos National Laboratory (1992 and 
19950]. Of the viruses with the greatest homology to 

25 MN-rgpl20, two (C6 and C8) occurred as early 

infections, prior to complete immunization, and one 
(C17) occurred as a late infection. 

Polymorphism in the V3 Domain. Of particular 
30 interest were polymorphisms in regions known to contain 
epitopes recognized by virus neutralizing antibodies. 
The best characterized neutralizing epitope, the 
principal neutralizing determinant (PND) , occurs at the 
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tip of the V3 loop. In subtype B viruses, 
approximately 60% possess the MN serotype-def ining 
signature sequence, IGPGRAF (SEQ. I'D. NO. 39) , based on 
identity with the prototypic MN strain of HIV-l 
5 [Berman et al.; J • Virol. 7:4464-9 (1992); 

Myers et al.; Retroviruses and AIDS Database, Los 
Alamos National Laboratory (1992 and 1995); 
La Rosa et al.; Science 249:932-5 (1990)]. 

Three of the viruses (C6, C8 , and C17) possessed 

10 the MN serotype signature sequence (Figure 3). In 

contrast, four viruses possessed sequences with radical 
amino acid substitutions in the PND [ IGPGRAW (C7) , 
LGPGSTF (Cll), IGPGRVL (CIO), and IGPGSAF (C15) J 
(SEQ. ID. NOs. 40-43, respectively), and therefore were 

15 classified as "non-MN like 11 viruses. Of note, each of 
the four w non-MN-like" sequences were rare (Table 6) 
and were not typical of the most common "non-MN" 
variants of subtype B viruses [Myers et al.; 
Retroviruses and AIDS Database, Los Alamos National 

20 Laboratory (1992 and 1995)]. 
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TABLE 6 

Frequency of Polymorphisms at the Principal 
Neutralizing Determinant in HIV-l Infected 
individuals Immunized with MN-rgpl2 0* 

V3 Sequence Observed Dataset Frequency 

GNE LANL LANL.l LaRosa 



10 



Seouence 


n Freauencv 


in 


= 52) 


(n 


= 519) 


(n 


= 160) 


(n 


=245 


GPGRAF 


3 0.42 


0 


.67 


0 


. 57 


0 


.66 


0 


.60 


GPGRAW 


1 0. 14 


0 


.03 


0 


.013 


0 


.06 


0 


.010 


GPGRVL 


1 0.14 


<0, 


02 


0 


004 


<0. 


006 


<0 


008 


GPGSTF* * 


1 0.14 


<0. 


02 


<0. 


002 


<0. 


006 


<0 


004 


GPGSAF 


1 0.14 


0. 


02 


0. 


011 


<0. 


006 


<0. 


004 



15 * ~ Da ta set GNE refers to a collection of 

52 independent isolates collected in 1992; 
dataset LANL refers to a collection of 
519 sequences reported by Myers et al., 
Retroviruses and AIDS Database, Los Alamos 

20 National Laboratory 1992 and 1995; LANL.l refers 

to a collection of 160 epidemiologically unlinked 
individuals provided by B. Korber (personal 
communication) ; dataset La Rosa refers to sequence 
data reported by La Rosa et al., Science 249:932-5 
25 (1990). 

** - Sequences were not present in the data sets 
examined. 



The prevalence of viruses with PND sequences 
matching the breakthrough viruses ranged from a high of 
1.3% (C7) to a low of 0.2% (Cll) in a listing of 519 
subtype B sequences compiled by the Los Alamos National 
Laboratory [Myers et al.; Retroviruses and AIDS 
Database, Los Alamos National Laboratory (1992 and 
1995)]. Similarly low frequencies were observed in 
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three other independently derived data sets (Table 6} . 
The occurrence of these sequences did not differ 
significantly between data sets collected prior to 1985 
[La Rosa et al.; Science 249:932-5 (1990)] and data 
5 collected 1992, or from a set of 160 epidemiologically 
unlinked individuals (B. Korber, personal 
communication) . All four sets of data agreed that the 
prevalence of viruses with MN-like PND sequences was in 
the range of 60%. Based on this data, four of the 
10 seven breakthrough infections were determined to be 

caused by viruses that fell outside of the spectrum of 
viruses that the vaccine was expected to prevent. 

Other features of breakthrough virus V3 domains, 

15 Like MN-rgpl20, the V3 domains of all of the 

breakthrough viruses were 36 amino acids in length. 
However, all seven viruses differed from MN-rgpl20 with 
respect to the number of glycosylat ion sites and with 
respect to the syncytium-inducing (SI) signature 

2 0 sequence . 

The sequence of MN-rgpl20 is somewhat unusual 
[Myers et al.; Retroviruses and AIDS Database, Los 
Alamos National Laboratory (1992 and 1995)] in that it 
lacks an N-linked glycosylat ion site at position 306 in 

25 the V3 domain. The lack of this glycosylation site 

does not appear to be antigenically significant since 
antisera to MN-rgpl20 are known to neutralize a variety 
of viruses (e.g. SF-2, DU6587-5, DU4489-5, CC) that 
possess a glycosylation site at this position 

30 [Berman et al.; J . Virol. 7:4464-9 (1992)] 

In addition, the V3 domain of MN-rgpl20 possessed 
sequence polymorphisms (R at position 311, K at 
position 324, K at position 323) typical of syncytium 
inducing viruses [Fouchier et al.; J. Virol, 66:3183-87 

35 (1992)], whereas all seven breakthrough viruses 

possessed sequences associated with non-syncytium- 
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inducing viruses. Syncytium-inducing viruses have been 
associated with rapid disease progression [Tersmette et 
al.; J. Virol. 62:2026-32 (1988)] and T cell tropism 
[O'Brien et al.; Nature (London) 348:69-73 (1990); 
5 Shioda et al.; Nature (London) 349:167-9 (1991)]. To 
date viruses with these properties have not been 
recovered from any of the MN-rgpl20 immunized 
volunteers. 

10 Polymorphism in the Vl, V2 and C4 domains. 

Previous investigations have identified additional 
neutralizing epitopes in the Vl, V2 and C4 domains of 
gpl20 [Nakamura et al.; J . Virol. 67:6179-91 (1993); 
McKeating et al.; AIDS Research and Human Retroviruses 
15 8:451-9 (1992); Ho et al.; J . Virol. 65:489-93 (1991); 
Barbas et al.; Proc . Natl. Acad. Sci . USA 91:3809-13 
(1994); McKeating et al.; J . Virol. 67:4932-44 (1993); 
Moore et al.; J. Virol. 67:6136-6151 (1993); 
Davis et al.; J. Gen. Virol. 74:2609-17 (1993)]. 
20 The best characterized of these neutralizing 

epitopes is in the C4 domain which has attracted 
special attention because antibodies binding to this 
area are known to block the binding of gpl20 to CD4 
[Moore et al.; AIDS 3:155-63 (1989); McKeating et al.; 
25 AIDS Research and Human Retroviruses 8:451-9 (1992)]. 
Because the epitope is located in a conserved (C) 
domain, naturally-occurring polymorphism in this region 
is far more limited than in other neutralizing 
epitopes. Nakamura et al.; J. Virol. 67:6179-91 (1993) 
30 reported that the binding of a number of neutralizing 
MAbs was dependent on K at position 4 29. 

Comparison of the sequence of MN-rgpl20 with other 
strains of HIV-1 showed that a common polymorphism, 
involving the substitution of E for K, occurs at this 
35 position. Indeed, substrains of the same virus isolate 
often show polymorphism at this position. The HXB2 
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substrain of HIV-1 I-A , contains K at position 429, 
whereas the BH10, IIIB, and LAV substrains of the 
HIV-1^, contain E at this position [Nakamura et al.; 
J. Virol. 67:6179-91 (1993)]. Similarly, the 1984 
5 isolate of HIV-1 MN exhibited E at this position, while 
the 1990 isolate of HIV-1 MN , used to produce MN-rgpl20, 
possessed K at this position. 

When the sequences of the infected vaccine 
recipients were examined (Figure 3), the virus from 

10 subject C17, like MN-rgpl20 contained K at position 

429, whereas the six other viruses that differed from 
the vaccine immunogen possessed E at this position. 
These results demonstrated that six of the seven 
breakthrough viruses differed from the vaccine 

15 immunogen at the CD4-blocking, neutralizing epitope in 
the C4 domain of gpl20. 

Studies with monoclonal antibodies have defined 
neutralizing epitopes in the VI and V2 domains of gpl20 
[McKeating et al.; J. Virol. 67:4932-44 (1993); Moore 

20 et al,; J . Virol. 67:6136-6151 (1993); Davis et al.; 
J". Gen. Virol. 74:2609-17 (1993)]. Like the 
polymorphisms that occur in the C4 domain, the V2 
domains exhibit several common polymorphisms that 
affect the binding of virus neutralizing antibodies. 

25 One such polymorphism occurs at position 171 which is 
critically important for the binding of murine 
MAb 1025, whereas residue 187 is important for the 
binding of MAb several MAbs represented by 1088. 
When the V2 domain sequences were examined 

30 (Figure 3) , all of the inf ected-vaccinee viruses 
differed from MN-rgpl20 in that R replaced G at 
position 171 and I or V replaced E at position 187. 
Antibodies recognizing these adjacent sites in the 
V2 domain of MN-rgpl20 would not be expected to 

35 neutralize viruses with radical amino acid 

substitutions at these position. Thus, all seven 



-77- 



WO 98/01564 



PCT/US97/09690 



breakthrough viruses differed from MN-rgpl20 at a 
neutralizing epitope in the V2 domain of gpl20. 

Other neutralizing epitopes have been reported in 
the VI domain of gpl20 [O'Brien et al.; Nature (London) 
5 348:69-73 (1990); McKeating et al.; J . Virol. 

67:4932-44 (1993)]. Although the neutralizing epitopes 
in the VI domain of MN-rgpl20 have not been 
characterized, the polymorphism seen among the 
breakthrough viruses in this region was interesting. 
10 Particularly striking (Figure 3) was that the length of 
this domain ranged from 20 amino acids (C17) to 
45 amino acids (C6) , and the number of N-linked 
glycosylation sites ranged from 2 to 6 . In contrast, 
the VI domain of MN-rgpl20 is 31 amino acids in length 
15 and encodes three N-linked glycosylation sites. 

Although examination of sequence databases suggest 
that variation in the V2 region is comparable to the VI 
region, the V2 region of the breakthrough viruses 
showed less variation than expected. Specifically, the 
20 length of the V2 region ranged from 36 amino acids (C7) 
to 39 amino acids in length, with six of seven viruses 
containing three N-linked glycosylation sites in this 
domain. A high degree of polymorphism was found in the 
V4 region where sequences ranged from 26 (C10) to 33 
25 (C15, C7) amino acids in length and contained either 4 
or 5 N-linked glycosylation sites. 

Antigenicity of envelope glycoproteins from 
breakthrough viruses. To determine the significance of 

30 sequence variation on glycoprotein antigenicity, 

recombinant gpl20 was prepared from the viruses of all 
seven infected vaccinees (Figure 4). in these studies 
a series of MAbs was assembled and their binding to 
MN-rgpl20 was compared to that of rgpi20 from the 

35 vaccinee isolates by ELISA (Table 7) . 
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TABLE 7 

Relative Reactivity of MAb Binding to rgp!20 fr m 
Infected Subj cts Compared with Binding to MN-rgpl20 







V3 


Discontinuous 


C8 


V2 


crol20 


1034 


50. 1 


1.5E 


1025 


1024 


1088 


MN 


1.0 


1. 00 


1.00 


1 .00 


1.00 


1.00 


to • J. 


U.J/ 




0. 17 


0. 00 


0.00 


0 . 00 


C6 . 5 


0.33 


0.33 


0.75 


0.00 


0. 00 


0.00 


C8 . 3 


0 . 11 


0.37 


0. 38 


0 . 00 


u . uu 


n nn 


C8. 6 


0.14 


0.34 


0. 29 


0 . 00 


0.00 


0. 00 


C7.2 


0.47 


0.60 


0.71 


0. 00 


0. 00 


0. 00 


C11.5 


0.00 


0.00 


0. 17 


0. 00 


0. 00 


0.00 


C11.7 


0.00 


0. 00 


0. 17 


0.00 


0. 00 


0.00 


CIO. 5 


0.33 


0.40 


0.46 


0 .24 


0. 03 


0.04 


CIO. 7 


0.42 


0.48 


0. 50 


0.29 


0. 07 


0. 09 


C17. 1 


0.33 


0 . 52 


0.33 


0. 00 


0.30 


0. 07 


C17 . 3 


0.37 


0. 56 


0.33 


0 . 00 


0. 38 


0. 06 


C15.2 


0.00 


0.47 


0.92 


0. 00 


0.00 


0. 00 


C15.3 


0.00 


0.37 


0.63 


0. 00 


0. 00 


0. 00 



20 

* - Relative reactivity values represent ratio of 
optical densities obtained with rgpl20 from 
patient isolates divided by optical density 
obtained for MN-rgpl20 at a MAb concentration of 
25 2 micrograms per milliliter. 

In control experiments, the binding of MAb 5B6 
(which is specific for the HSV gD-1 flag epitope fused 
to the N terminus of all of the rgpl20 protein) was 

30 used to standardize the amount of gpl20 from each 

isolate (Figure 5A) . These studies demonstrated that 
the assay was carried out under conditions where 
equivalent amount of rgpl20s were captured onto wells 
of microtiter plates. 

35 The antigenic structure of the V3 domain was 

examined using the 1034 MAb (isolated from mice 
immunized with MN-rgpl20 as described in 
Nakamura et al.; J . Virol. 67:6179-9.1 (1993) and the 
50.1 MAb (prepared from mice immunized with a synthetic 

40 V3 domain peptide as described in Rini et al.; Proc . 
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Natl. Acad. Sci . t/SA 90:6325-9 (1993). Both MAbs are 
known to exhibit potent virus neutralizing activity. 
When binding to the recombinant proteins was examined, 
the MAb binding to MN-rgpl20 was at least 10-fold 
5 greater than to any of the breakthrough virus envelope 
proteins (Figure 5 B and C) . Surprisingly, rgpl20 from 
the three patient isolates (C8, C6, and C17) that 
possessed the MN serotype-def ining sequence, IGPGRAF 
(SEQ. ID. NO. 39), varied from one another in their MAb 
10 binding activity. Thus, the binding of MAb 1034 and 
MAb 50.1 to rgpl20 from C17 was significantly greater 
than the binding to rgpl20s from C6 and C8 . 

A distinction in the epitopes recognized by these 
MAbs was evident since C6-rgpl20 and C8-rgpl20 gave 
15 comparable binding with 50.1, whereas 1034 bound better 
to the C6-derived protein than the C8-derived protein. 
The poorest MAb reactivity was with rgpl20s from Cll 
and C15. This result was consistent with sequence 
analysis demonstrating that these two viruses both 
20 possessed the radical substitution of s for R at 

position 18 in the V3 domain. Surprisingly, both of 
these MAbs exhibited better than expected binding to 
rgpl20 from the C7 and C10 viruses. Like MN-rgpl20, 
both proteins contained the penta-peptide, IGPGR 
25 sequence (SEQ. ID. NO. 44) in the V3 loop, but differed 
from MN-rgpl20 in that V and L replaced A and F at 
positions 319 and 320 in gpl20 from cio, and w replaced 
Fat position 320 in gp!20 from C7 . These results 
indicate that R at position 318 is essential for the 
30 binding of these two MAbs, and that the epitopes 
recognized by 1034 and 50,1 are not completely 
destroyed by the hydrophobic substitutions at 
positions 319 and 320. 

As predicted from the sequence data, there was 
35 little if any binding to the breakthrough virus rgpl20s 
using MAbs (1088 and 1025) directed to the V2 region of 
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MN-rgpl20. Also consistent with sequence data was the 
observation that MAb 1024 directed to the C4 domain of 
MN-rgpl20 gave some reactivity with C17-rgpl20 which, 
like MN-rgpl20 contained K at position 429 , but gave no 
5 reactivity with the other isolates that contained E at 
residue 429. 

Together, these studies demonstrated that the 
antigenic structure of all seven breakthrough viruses 
differed from the vaccine immunogen at three well 

10 characterized neutralizing epitopes. 

A totally different pattern of reactivity was 
observed with the human hybridoma, MAb 15e, prepared 
from an HIV-1 infected individual as described in Ho et 
al.; J. Virol. 65:489-93 (1991). With this MAb, the 

15 greatest binding was achieved with MN-rgpl20 and rgpl20 
from C7, and the poorest reactivity was seen with the 
two clones of rgp!20 from the Cll. Moderate, but 
comparable reactivity was seen with rgpl20s from the 
CIO and C17. 

20 These results demonstrate that the 15e epitope is 

polymorphic, and that the epitope is conserved on 
MN-rgpl20 and rgp!20 from C7, but has been lost on 
rgpl20s from Cll. Interestingly, the two different 
clones of gpl20 derived from C6 gave strikingly 

25 different patterns of antibody binding. Thus, rgpl20 
from clone C6.5 exhibited strong reactivity with this 
antibody, whereas rgpl20 from clones C6.1 exhibited 
significantly weaker activity with this MAb. 
Comparison of sequence data (Figure 3) showed that the 

30 two C6 clones differed at 6 amino acid positions. 
Based on comparative binding to the other viral 
proteins of known sequence, it appeared that the 
substitution of K for I at position 351 in the C3 
domain of gpl20 could account for the difference in 

35 binding activity. This result is also consistent with 
both clones of Cll similarly containing a positively- 
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charged K at this position, and also being poorly 
reactive with this MAb. Alternatively, a T for I 
substitution at position 4 39 in the C4 domain could 
account for the difference in 15e binding between C6.1 
5 and C6.5. Although the inability of the two Cll clones 
to bind 15e cannot be explained by polymorphism at this 
position in the C4 domain, they could be affected by 
the adjacent T for M substitution at position 434. 

10 Discussion 

In these studies, the viruses and immune responses 
in seven of nine vaccinees who became infected with 
HIV-l through high risk activity while participating in 
Phase I or Phase 2 trials of MN-rgpl20, a candidate 

15 HIV-l vaccine were analyzed. Such infections would be 
expected to occur for one of two reasons: i) lack of 
sufficient immune response at the time of infection; or 
2) infection with viruses that fall outside of the 
antigenic spectrum expected to be covered by the 

20 vaccine immunogen. The data indicate that both 
explanations may be involved with the infections 
observed (Table 8) . 
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TABLE 8 

summary of Breakthrough Infecti ns 

Homologous to MN-rgpl20 









MN-rgpl20 








5 


Case No. 


Adequate 


Homology 


V3 


C4 


V2 






Immunization 


f%) 




Epitope 


Epitope 




C6 




79 










C8 




78 


+ 








CIS 




72 










C7 




70 








10 


Cll 


+ 


75 










CIO 


+ 


69 










C17 


+ 


80 


+ 


+ 





15 Two of the infections occurred in individuals who 

failed to receive the minimum throe doses of vaccine 
typically required for the induction of protective 
immunity with protein subunit vaccines (e.g. hepatitis 
B virus formulated in alum adjuvant as described in 

20 Francis et al.; Ann. Int. Med. 97:362-6 (1982). Two 
additional breakthrough infections occurred in 
vaccinees who had weak or undetectable primary (C7) and 
booster (CIS) responses. Of the three individuals who 
became infected with HIV-1 after receiving three or 

25 more productive immunizations (CIO, Cll, and C17), at 
least two, and possibly all three, appear to have 
become infected more than six months after receiving 
their last immunization. Because antibody titers to 
MN-rgpl20 typically decay with a half-time of 2 to 2-5 

30 months (Belshe et al.; JAMA 272 (6) : 475-80 (1994); 

Berman et al.; AIDS 8:591-601 (1994)], antibody titers 
would be expected to have decayed at least eight-fold 
and possibly as much as sixty four-fold at the time of 
infection. Thus, the lack of a sufficient immune 
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response at the time of infection represents a 
potential explanation for at least six of the seven 
breakthrough infections . 

Data from vaccine efficacy studies in gpl60 
5 immunized chimpanzees [McElrath et al.; Longitudinal 
Vaccine-Induced Immunity and Risk Behavior of Study 
Participants in AVEG Phase II Protocol 201. In: 
Abstracts from Eighth Annual Meeting of the National 
Cooperative Vaccine Development Groups for AIDS. 
10 Bethseda, MD 1996:216] challenged with HIV-l, and 
gpl20-immunized rhesus macaques challenged with a 
chimeric SIV/HIV-1 virus (SHIV) suggest that the 
magnitude of the neutralizing antibody response at the 
time of infection is a critical correlate of protective 
15 immunity. If maintaining neutralizing antibody titers 
proves to be a valid correlate of protective immunity 
in humans, then formulations (e.g. novel adjuvants) or 
immunization regimes (frequent boosting) designed to 
maximize the antibody responses may be required to 
20 achieve long lasting protection. Use of a booster 
every six months may be advantageous. 

The other likely explanation for the late 
infections is the antigenic difference between the 
vaccine and the breakthrough virus envelope 
25 glycoproteins. This explanation is supported by the 

observation that four of the seven breakthrough viruses 
possessed envelope glycoproteins that differed from the 
MN-rgpl20 by 25-30% at the amino acid level. 
Differences of this magnitude have historically 
30 [Myers et al.; Retroviruses and AIDS Database, Los 
Alamos National Laboratory (1992 and 1995)] been 
associated with inter-subtype variation and far exceeds 
the average 10-20% variation expected for viruses 
within the same subtype. 
35 Although the biologic significance of sequence 

variation in many regions of the envelope glycoprotein 
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is unclear, polymorphism at neutralizing epitopes is an 
important factor that affects vaccine efficacy. 
Previous studies [Salmon-Ceron et al.; AIDS Res. and 
Human Retroviruses 11:1479-86 (1995); Javaherian et 
5 al.; Science 250:1590-3 (1990)] have demonstrated that 
the breadth of neutralizing activity that could be 
elicited by HIV-1 envelope derived vaccines was 
critically dependent on the sequence of epitopes in the 
V3 domain (e.g.; the PND). Thus, candidate vaccines 

10 based on the LAI strain of HIV-1 (the prototypic 

"non-MN-like" subtype B virus) , exhibited little or no 
cross neutralizing activity with subtype B viruses, 
whereas vaccines that contained the "MN-like-" PND 
sequence (IGPGRAF) (SEQ. ID. NO. 44) exhibited broad 

15 cross neutralizing activity. That four of the seven 
breakthrough viruses possessed envelope glycoproteins 
with radical amino acid substitutions in the PND is 
consistent with the explanation that differences in 
antigenic structure explain some of these infections. 

20 Over the last few years, it has become clear that 

polymorphism among u MN-like" viruses occurs at 
neutralizing epitopes outside of the PND. The best 
example occurs in the C4 domain where two antigenically 
distinct variants are distinguished by the presence of 

25 either K or E at position 429 (Moore et al.; AIDS 
3:155-63 (1989)]. Because six of the seven 
breakthrough viruses differed from the vaccine strain 
in that they contained E rather than K at position 429, 
antibodies raised to the C4 domain of MN-rgpl20 were 

30 unlikely to neutralize the viruses infecting in six of 
the seven vaccinees. 

Other neutralizing epitopes are known to be 
present in the VI and V2 domains of gpl20. Although 
these regions are highly variable, due to insertions 

35 and deletions, neutralizing epitopes have been 

described by McKeating et al.; J. Virol. 67:4932-44 
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(1993); Moore et al.; J. Virol. 67:6136-6151 (1993); 
and Davis et al.; J - Gen. Virol. 74:2609-17 (1993). 
Several of these epitopes overlap an amino terminal 
sequence of the V2 domain containing the tri-peptide 
5 sequence RDK at positions corresponding to 142 to 144 
of MN-rgpl20 [McKeating et al.; J . Virol. 67:4932-44 
(1993); Moore et al-; J. Virol. 67:6136-6151 (1993)]. 
Like the C4 epitope, variation in this sequence is 
known to occur between different substrains derived 
10 from the same parental isolate. Since all seven 

breakthrough viruses differed from MN-rgpl20 in that 
they possessed the RDK sequence, rather than the GDK 
sequence present in the vaccine antigen, neutralizing 
antibodies to the V2 domain of MN-rgpl20 would not have 
15 been expected neutralize any of the viruses recovered 
from the vaccinees immunized with MN-rgpl20. 

Although polymorphisms at neutralizing epitopes 
might account for the lack of protection in most of the 
infections, this does not appear to explain the 
20 infection of vaccinee C17, who was infected by a virus 
that matched MN-rgpl20 in the V3 and C4 domains. If a 
difference in sequence was responsible for the lack of 
protection in this case, the critical difference might 
relate to the unusual sequence in the VI domain of 
25 gpl20 from this breakthrough virus. Several studies 
have shown that the VI domain possesses epitopes 
recognized by virus neutralizing monoclonal antibodies 
[McKeating et al.; J • Viroi . 67:4932-44 (1993); 
Davis et al.; J . Gen. Virol. 74:2609-17 (1993); 
30 Kayman et al.; J . Viroi . 68:400-410 (1994)]. 

Although far less is known about the Vl epitopes 
relative to other neutralizing sites, the Vl epitopes 
appear to be conformation-dependent, and antisera from 
HIV-1 infected individuals recognize epitopes in the VI 
35 and V2 domains [McKeating et al.; J . Virol. 67:4932-44 
(1993); Kayman et al.; J . Virol. 68:400-410 (1994)]. 
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The VI sequence of the virus from CI 7 is noteworthy 
because it is smaller and contains fewer N-linked 
glycosylation sites than that of MN-rgpl20 or any of 
the other breakthrough viruses. By the same token, the 
5 envelope glycoproteins from Cll and C6 are noteworthy 
because they are significantly larger and contain more 
glycosylation sites than MN-rgpl20 or the other 
breakthrough viruses. 

While differences in amino acid sequence can 

10 provide clues to differences in antigenic structure, 
the consequences of such polymorphism can only be 
proven through antibody binding studies. To correlate 
differences in sequence with differences in antigenic 
structure, gpl20 from two clones each of all seven 

15 breakthrough viruses was expressed and the antigenicity 
of the clones with a panel of monoclonal antibodies was 
examined. As predicted from the sequence data, none of 
the breakthrough virus envelope glycoproteins reacted 
with neutralizing MAbs to the V2 domain of MN-rgpl20. 

20 When MAbs to the C4 domain were examined, only the C17 
envelope glycoprotein (that matched MN-rgpl20 with 
respect to K429) showed significant, albeit lower, 
binding. Surprisingly, the three breakthrough envelope 
glycoproteins that contained the subtype B PND 

25 consensus sequence, IGPGRAF (SEQ. ID. NO. !!), gave 

poor reactivity with all three PND directed MAbs, even 
though they possessed PND sequences closely related to 
the vaccine immunogen. Thus, all three of the vaccinee 
isolates appeared to possess changes outside of the 

30 recognition site that interfered with MAb binding. 

It has been known for many years that resistance 
to neutralization in vitro can sometimes be attributed 
to mutations in remote sequences that alter the 
conformation of neutralizing epitopes and interfere 

35 with recognition by virus neutralizing antibodies 

[Nara et al.; J . Virol. 64:3779-91 (1990); Cordonnier 
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10 



15 



et al.; Nature 340:571-4 (1989)]. Together, these 
results indicate that the antigenic structure of the 
envelope glycoproteins recovered from the breakthrough 
viruses differed significantly from that of the vaccine 
antigen, 

A novel result was the localization of residues in 
the C3 domain that appeared to affect the binding of 
the virus neutralizing human MAb, I5e. This MAb is 
known to recognize a discontinuous epitope, block CD 4 
binding, and neutralize a variety of laboratory and 
primary isolates of HIV-i [Ho et al.; J. Virol. 
65:489-93 (1991); Thali et al.; J . Virol. 66:5635-5641 
(1992); Moore et al.; AIDS Res. Hum. Retroviruses 
9: 1179-1187 (1993) ] . 

Comparative binding to envelope glycoproteins from 
the breakthrough viruses indicated that recognition by 
this antibody is critically dependent on residues in 
the C3 or C4 domains of gpl20. The unique occurrence 
of a positively charged K at position 351 in the C3 
20 domain provides a common explanation for the inability 
of the C11.5, Cll. 7 and C6.1 strains of HIV-i to bind 
to 15e. Alternatively, it is possible that different 
amino acid substitutions in different locations account 
for the failure of 15e to bind to rgpi20s from the C6 
25 and Cll clones. The only obvious positions where 

substitutions of this type occur are in the C4 domain 
where T replaces M at 434 (Cll) and T replaces I at 
439. 

The present studies demonstrate that the current 
30 formulation of MN-rgpl20 is less than 100% effective 
against HIV-1 infection. Based on previous in vitro 
and in vivo studies with MN-rgpl20, protection from 
natural HIV-l infection in humans is expected to depend 
on a threshold concentration of virus-neutralizing 
35 antibodies, and antigenic similarity between the 
vaccine immunogen and the challenge virus. 
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In this regard, only one of the seven breakthrough 
infections (C17) was unexpected. This individual 
received a full course of immunizations yet became 
infected with a virus similar to MN-rgpl20 at at least 
5 two important neutralizing epitopes (V3 and C4 

domains) . This infection might be related to the 
magnitude of the antibody response at the time of 
infection, or antigenic differences between the 
breakthrough virus and the vaccine strain, or 

10 circumstances of infection (e.g., ulcerative lesions, 
infection by donor with acute infection or high 
viremia) f not monitored in this protocol - 
Alternatively this individual may represent a true 
vaccine failure, without clear explanation. 

15 On balance, the analysis of breakthrough 

infections described herein did not uncover any data 
that would discourage the continued development of 
MN-rgpl20 as a vaccine to prevent HIV-1 infection. The 
results support speculation that enhancing vaccine 

20 immunogenicity (as by additional booster immunizations) 
may be required to maintain long term protective 
immunity, and that the addition of rgpl20 from other 
antigenically different strains of virus in addition to 
MN-rgpl20 are useful to expand the breadth of 

25 protection. 

The availability of viruses and viral 
glycoproteins derived from breakthrough infections may 
provide an important means to streamline the process of 
. identifying new antigens for inclusion into a 

30 multivalent vaccine. Recombinant viral glycoproteins 
prepared from breakthrough viruses, by definition, 
possess antigenic structures that are significantly 
different from MN-rgpl20, and are be representative of 
viruses currently being transmitted. Thus, combining 

35 rgpl20 from breakthrough viruses with MN-rgpl20 is an 
effective way complement and significantly expand 
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antigenic complexity and increase breadth of cross 
neutralizing activity. 
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10 (B) STREET: 25 Metro Drive, Suite 700 
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(viii) ATTORNEY/ AGENT INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO:l: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1503 base pairs 

35 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

40 GGG GTA CCT GTG TGG AAG GAA GCA ACC ACC ACT CTA 36 

Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
15 10 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG 75 
45 Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
15 20 25 

CAT AAT GTT TGG GCC ACA CAT GCT TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
50 30 35 

CCA AAC CCA CAA GAA ATG GTA TTG GAA AAT GTG ACA GAA 153 
Pro Asn Pro Gin Glu Met Val Leu Glu Asn Val Thr Glu 
40 45 50 



55 



GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG 192 
Asp Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met 
55 60 



60 CAT GAG GAT ATA ATC ACT TTA TGG GAT CAA AGC CTA AAA 231 

His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys 
65 70 75 
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CCA TGT GTA AAA TTA ACC CCA CTC TGT ATT ACT TTA AAT 270 
Pro Cys Val Lys Leu Thr Pro Leu Cys lie Thr Leu Asn 
80 85 90 

5 TGC ACC AAT TGG AAG AAG AAT GAT ACT AAA ACT AAT AGT 309 

Cys Thr Asn Trp Lys Lys Asn Asp Thr Lys Thr Asn Ser 
95 100 

AGT AGT ACT ACA ACT AAT AAT AGT AGT GCT AC A GCT AAT 348 
10 Ser Ser Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn 

X05 110 us 

AGT AGT AGT ACT ACA ACT AAT AGT AGT TGG GCA GAG ATA 387 
Ser Ser Ser Thr Thr Thr Asn Ser Ser Trp Gly Glu lie 
15 120 125 

AAG GAG GGA GAA ATA AAG AAC TGC TCT TTC AAT ATC ACC 4 26 
Lys Glu Gly Glu lie Lys Asn Cys Ser Phe Asn lie Thr 
130 135 140 

20 

ACA AGC ATA AG A GAC AAG GTG AAG AAA GAA TAT GCA CTT 465 
Thr Ser lie Arg Asp Lys Val Lys Lys Glu Tyr Ala Leu 
145 150 155 

2 5 TTT TAT AGC CTT GAT GTA GTA CCA ATA GAA AAT CAT AAT 504 

Phe Tyr Ser Leu Asp Val Val Pro He? Glu Ann Asp Asn 
160 165 

ACT AGC TAT AGG TTG AG A AGT TGT AAC ACC TCA CTC ATT 54 3 
30 Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val He 

170 175 180 

ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA ATT CCC 582 
Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro lie Pro 
35 185 190 

ATA CAT TAT TGT ACC CCG GCT GGT TTT GCG ATT CTG AAG 621 

. He His Tyr Cys Thr Pro Ala Gly Phe Ala He Leu Lys 
195 200 205 

40 

TGT AGA GAT AAA AAG TTC AAT GGA ACA GCA CCA TCC AAA 660 

Cys Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys 
210 215 220 

45 AAT GTT AGC ACA GTA CAA TGT GCA CAT GGA ATT AAG CCA 699 

Asn Val Ser Thr Val Gin Cys Ala His Gly He Lys Pro 
225 230 

GTA GTG TCA ACT CAA CTG CTG TTA AAT GGC AGC CTA GCA 738 
50 Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala 

235 240 245 

GAA GAA GAG GTA ATA ATT AGA TCT GCC AAT TTC TCA AAC 77 7 
Glu Glu Glu Val lie lie Arg Ser Ai a Asn Phe Ser Asn 
55 250 255 

AAT GCT AAA ATC ATA ATA GTA CAG TTG AGG GAA CCT GTA 816 
Asn Ala Lys He He He Val Gin Leu Arcj Glu Pro Val 
260 265 270 



60 



GAA ATT AAT TGT ACA AGA CCC AGC AAC AAT ACA ATA AAA 855 
Glu He Asn Cys Thr Arg Pro Ser Asn Asn Thr lie Lys 
275 280 285 
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GGT ATA CAC ATA GGA CCA GGG AGA GCA TTT TAT GCA ACA 894 
Gly He His He Gly Pro Gly Arg Ala Phe Tyr Ala Thr 
290 295 

5 GGA GAC ATA CGA GGA GAT ATA AGA CAA GCA CAT TGT AAC 933 

Gly Asp He Arg Gly Asp He Arg Gin Ala His Cys Asn 
300 305 310 

ATT AGT GGA GCA AAA TGG AAT AAC ACT TTA AAG A AG GTA 972 
10 He Ser Gly Ala Lys Trp Asn Asn Thr Leu Lys Lys Val 

315 320 

GTT AAA AAA TTA AAA GAA CAA TTT CCA AAT AAA ACA ATA 1011 
Val Lys Lys Leu Lys Glu Gin Phe Pro Asn Lys Thr He 
15 325 330 335 

GTC TTT AAC CAT TCC TCA GGA GGG GAC CCA GAA ATT GTA 1050 
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val 
340 345 350 



20 



60 



ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT TTC TAC TGT 1089 
Met His Ser Phe Asn Cys Gin Gly Glu Phe Phe Tyr Cys 
355 360 



2 5 AAT ACA ACA AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT 1128 

Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp Thr 
365 370 375 

ACA GAG TCA AAT AAC AAT GAT AGT ACT ATT ACA CTC CCA 1167 
30 Thr Glu Ser Asn Asn Asn Asp Ser Thr He Thr Leu Pro 

380 385 

TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA ATA 1206 
Cys Arg lie Lys Gin lie He Asn Met Trp Gin Glu lie 
35 390 395 400 

GGA AAA GCA ATG TAT GCC CCT CCC ACC AGA GGA GAA ATT 124 5 

Gly Lys Ala Met Tyr Ala Pro Pro Thr Arg Gly Glu He 

405 410 415 

40 

AAA TGT TCA TCA AAT ATT ACA GGA CTA CTG TTA ATA AGA 1284 

Lys Cys Ser Ser Asn lie Thr Gly Leu Leu Leu lie Arg 
420 425 

4 5 GAT GGT GGT ATT AAC ACT AGC GAT GCC ACC GAG ACC TTC 1323 

Asp Gly Gly lie Asn Thr Ser Asp Ala Thr Glu Thr Phe 
430 435 440 

AGA CCG GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT 1362 
50 Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser 
445 450 

GAA TTA TAT AAA TAT AAA GTA GTC AAA ATT GAG CCA TTA 1401 
Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu 
55 455 460 465 

GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG CTG CAG 1440 
Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 
470 475 480 



AGA GAA AAA AGA GCA GTA ACA CTA GGA GCT ATG TTC CTT 1479 
Arg Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu 
485 490 
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GGG TTC TTA GGA GCA TAA AGC TTC 1503 
Gly Phe Leu Gly Ala Xaa Ser Phe 
495 500 501 

5 (2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 
15 10 is 

Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
20 25 30 

Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Met Val 
35 40 45 

Leu Glu Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn Asp Met 
50 55 60 



15 



20 



Val Glu Gin Met His Glu Asp lie He Ser Leu Trp Asp Gin Ser 
25 65 70 75 

Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys He Thr Leu Asn 
80 85 go 

30 Cys Thr Asn Trp Lys Lys Asn Asp Thr Lys Thr Asn Ser Ser Ser 

95 100 105 

Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn Ser Ser Ser Thr 

110 115 120 

J o 

Thr Thr Asn Ser Ser Trp Gly Glu He Lys Glu Gly Glu He Lys 

125 130 135 

Asn Cys Ser Phe Asn He Thr Thr Ser He Arg Asp Lys Val Lys 
40 140 145 150 

Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro He Glu 
155 160 165 

45 Asn Asp Asn Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val 

170 175 180 



50 



60 



65 



He Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro He Pro He 

185 190 195 

His Tyr Cys Thr Pro Ala Gly Phe Ala He Leu Lys Cys Arg Asp 

200 205 210 



Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val 
55 215 220 225 



Gin Cys Ala His Gly He Lys Pro Val Val Ser Thr Gin Leu Leu 

230 235 240 

Leu Asn Gly Ser Leu Ala Glu Glu Glu Val He He Arg Ser Ala 

245 250 255 

Asn Phe Ser Asn Asn Ala Lys He He lie Val Gin Leu Arg Glu 

260 265 270 



-94- 



WO 98/01564 



PCT/US97/09690 



Pro Val Glu lie Asn Cys Thr Arg Pro Ser Asn Asn Thr lie Lys 

275 280 285 

Gly He His He Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp 

5 290 295 300 

He Arq Gly Asp He Arg Gin Ala His Cys Asn He Ser Gly Ala 

305 310 315 

10 Lys Trp Asn Asn Thr Leu Lys Lys Val Val Lys Lys Leu Lys Glu 

320 325 330 

Gin Phe Pro Asn Lys Thr lie Val Phe Asn His Ser Ser Gly Gly 

335 340 345 

15 

Asp Pro Glu He Val Met His Ser Phe Asn Cys Gin Gly Glu Phe 

350 355 360 

Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp 

20 365 370 375 

Thr Thr Glu Ser Asn Asn Asn Asp Ser Thr He Thr Leu Pro Cys 

380 385 390 

25 Arq He Lys Gin He He Asn Met Trp Gin Glu He Gly Lys Ala 

395 400 405 

Met Tyr Ala Pro Pro Thr Arg Gly Glu He Lys Cys Ser Ser Asn 

410 415 420 

30 

lie Thr Gly Leu Leu Leu He Arg Asp Gly Gly He Asn Thr Ser 

425 430 435 

Asp Ala Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp 

35 440 445 450 

Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 

455 460 465 

40 Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 

470 475 480 

Arq Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu Gly Phe 

485 490 495 

Leu Gly Ala Xaa Ser Phe 
500 501 



45 



(2) INFORMATION FOR SEQ ID NO:3: 
50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1503 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGG GTA CCT GTA TGG AAA GAA GCA ACC ACC ACT CTA 36 
Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
1 5 10 

60 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC AC A GAG GTG 75 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
15 20 25 
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CAT AAT GTT TGG GCC ACA CAT GCT TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys val Pro Thr Asp 
30 35 

CCA AAC CCA CAA GAA ATG GTA TTG GAA AAT GTC ACA GAA 153 
Pro Asn Pro Gin Glu Met Val Leu Glu Asn Val Thr Glu 
40 45 50 

GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG 192 
Asp Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met 
55 60 

CAT GAG ANT ATA ATC AGT TTA TGG GAT CAA AGC CTA AAA 231 
His Glu Xaa He He Ser Leu Trp Asp Gin Ser Leu Lys 
15 65 70 75 

CCA TGT GTA AAA TTA ACC CCA CTC TGT ATT ACT TTA AAT 2 70 
Pro Cys Val Lys Leu Thr Pro Leu Cys He Thr Leu Asn 
80 85 go 

20 90 

TGC ACC AAT TGG AAG GAG AAT GAT ACT AAA ACT AAT AGT 309 
Cys Thr Asn Trp Lys Glu Asn Asp Thr Lys Thr Asn Ser 
95 100 

2 5 AGT AGT ACT ACA ACT AAT AAT AGT AGT GCT ACA GCT AAT 34 8 

Ser Ser Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn 
105 no 11S 

AGT AGT AGT ACT ACA ACT AAT AGT AGT TGG CGA GAC ATA 387 
Ser Ser Ser Thr Thr Thr Asn Ser Ser Trp Cly Glu He 
120 12S 

AAG GAG GGA GAA ATA AAG AAC TGC TCT TTC AAT ATC ACC 426 
Lys Glu Gly Glu He Lys Asn Cys Ser Phe Asn lie Thr 
35 130 135 140 

ACA GGC ATA AG A GAC AAG GTG AAG AAA GAA TAT CCA CTT 465 

Thr Gly He Arg Asp Lys Val Lys Lys Glu Tyr Ala Leu 
40 145 150 »5 

TTT TAT AGC CTT GAT GTA GTA CCA ATA GAA AAT GAT AAT 504 

Phe Tyr Ser Leu Asp Val Val Pro He Glu Asn Asp Asn 
160 165 

4 5 ACT AGC TAT AGG TTG AGA AGT TGT AAC ACC TCA GTC ATT 54 3 

Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val He 
170 175 180 



30 



50 



60 



ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA ATT CCC 582 
Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro He Pro 
185 190 



ATA CAT TAT TGT ACC CCG GCT GGT TTT GCG ATT CTG AAG 621 

205 



t - - — — - « - « www n a i v_ i vj nno 

He His Tyr Cys Thr Pro Ala Gly Phe Ala He Leu Lys 
55 195 200 



TGT AAA GAT AAA AAG TTC AAT GGA ACA GGA CCA TGC AAA 660 
Cys Lys Asp Lys Lys Phe Asn Gly Thr Cly Pro Cys Lys 
210 215 220 

AAT GTT AGC ACA GTA CAA TGT ACA CAT GGA ATT AAC CCA 699 
Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys Pro 
225 230 
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GTA GTG TCA ACT CAA CTG CTG TTA AAT GGC AGC CTA GCA 738 
Val Val Ser Thr Gin Leu Leu L u Asn Gly Ser Leu Ala 
235 240 245 

5 GAA GAA GAG GTA ATA ATT AG A TCT GCC AAT TTC TCA AAC 777 

Glu Glu Glu Val lie lie Arg Ser Ala Asn Phe Ser Asn 
250 255 

AAT GCT AAA ATC ATA ATA GTA CAG TTG A AG GAA CCT GTA 816 
10 Asn Ala Lys He He He Val Gin Leu Lys Glu Pro Val 
260 265 270 

GAA ATT AAT TGT ACA AG A CCC AGC AAC AAT ACA ATA AAA 85 5 
Glu He Asn Cys Thr Arg Pro Ser Asn Asn Thr lie Lys 
15 275 280 285 

GGT ATA CAC ATA GCA CCA GGG AG A GCA TTT TAT GCA ACA 894 
Gly He His He Gly Pro Gly Arg Ala Phe Tyr Ala Thr 
290 295 



20 



40 



60 



GGA GAC ATA CGA GGA GAT ATA AG A CAA GCA CAT TGT AAC 933 
Gly Asp lie Arg Gly Asp lie Arg Gin Ala His Cys Asn 
300 305 310 



25 ATT AGT GGA GCA AAA TGG AAT AAC ACT TTA AAG A AG GTA 97 2 

He Ser Gly Ala Lys Trp Asn Asn Thr Leu Lys Lys Val 
315 320 

GTT ATA AAA TTA AAA GAA CAA TTT CCA AAT AAA ACA ATA 1011 
30 Val lie Lys Leu Lys Glu Gin Phe Pro Asn Lys Thr lie 

325 330- 335 

GTC TTT AAC CAT TCC TCA GGA GGG GAC CCA GAA ATT GTA 10 50 
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val 
35 340 345 350 

ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT TTC TAC TGT 1089 
Met His Ser Phe Asn Cys Gin Gly Glu Phe Phe Tyr Cys 
355 360 



AAT ACA ACG AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT 1128 
Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp Thr 
365 370 375 



4 5 ACA GAG TCA AAT AAC AAT GAT AGT ACT ATT ACA CTC CCA 1167 

Thr Glu Ser Asn Asn Asn Asp Ser Thr lie Thr Leu Pro 
380 385 

TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA 1206 
50 Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val 

390 395 400 

GGA AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA GAA ATT 1245 
Gly Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Glu lie 
55 405 410 415 

AAA TGT TCA TCA AAT ATT ACA GGA CTA CTG TTA ACA AGA 1284 
Lys Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg 
420 425 



GAT GGT GGT ATT AAC ACT AGC GAT GCC ACC GAG ACC TTC 1323 
Asp Gly Gly lie Asn Thr Ser Asp Ala Thr Glu Thr Phe 
430 435 440 
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AGA CCG GGA GGA GGA CAT ATG AGG GAC AAT TG$ AGA AGT 1362 
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arq Ser 
445 450 

5 GAA TTA TAT AAA TAT AAA GTA GTG AAA ATT GAG CCA TTA 1401 

Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu 
455 460 465 

GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG 1440 
10 Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 
470 475 

AGA GAA AAA AGA GCA GTA ACA CTA GGA GCT ATG TTC CTT 1479 
Arg Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu 
15 485 490 



20 



30 



55 



60 



GGG TTC TTG GGA GCA TAA AGC TTC 1503 
Gly Phe Leu Gly Ala Xaa Ser Phe 
495 500 501 



(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: Amino Acid 
25 (D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 
15 10 is 

Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
20 25 30 



Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Cln Glu Met Val 
35 35 40 4S 

Leu Glu Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn Asd Met 
50 55 H 60 

40 Val Glu Gin Met His Glu Xaa He He Ser Leu Trp Asp Gin Ser 

65 70 75 

Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys He Thr Leu Asn 

80 85 90 

45 

Cys Thr Asn Trp Lys Glu Asn Asp Thr Lys Thr Asn Ser Ser Ser 

95 100 105 

Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn Ser Ser Ser Thr 
50 HO us 12Q 

Thr Thr Asn Ser Ser Trp Gly Glu lie Lys Glu Giy Glu He Lys 
125 130 135 

Asn Cys Ser Phe Asn He Thr Thr Gly lie Arg Asp Lys Val Lys 
140 145 iso 

Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro He Glu 
155 160 165 

Asn Asp Asn Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val 
170 175 180 



He Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro He Pro He 
65 185 190 195 
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His Tyr Cys Thr Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp 
200 205 210 

Lye Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val 
5 215 220 225 

Gin Cys Thr His Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu 
230 235 240 

10 Leu Asn Gly Ser Leu Ala Glu Glu Glu Val He He Arg Ser Ala 

245 250 255 



15 



30 



45 



60 



Asn Phe Ser Asn Asn Ala Lys He He He Val Gin Leu Lys Glu 

260 265 270 

Pro Val Glu lie Asn Cys Thr Arg Pro Ser Asn Asn Thr He Lys 

275 280 285 



Gly He His He Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp 
20 290 295 300 

He Arq Gly Asp He Arg Gin Ala His Cys Asn He Ser Gly Ala 
305 310 315 

25 Lys Trp Asn Asn Thr Leu Lys Lys Val Val lie Lys Leu Lys Glu 

320 325 330 



Gin Phe Pro Asn Lys Thr He Val Phe Asn His Ser Ser Gly Gly 

335 340 345 

Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gin Gly Glu Phe 

350 355 360 



Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp 
35 365 370 375 

Thr Thr Glu Ser Asn Asn Asn Asp Ser Thr He Thr Leu Pro Cys 
380 385 390 

40 Arg He Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 

395 400 405 



Met Tyr Ala Pro Pro lie Arg Gly Glu lie Lys Cys Ser Ser Asn 

410 415 420 

lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly lie Asn Thr Ser 

425 430 435 



Asp Ala Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp 
50 440 445 450 

Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
455 460 465 

55 Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 

470 475 480 



Arq Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu Gly Phe 
485 490 495 

Leu Gly Ala Xaa Ser Phe 

500 501 



(2) INFORMATION FOR SEQ ID NO: 5: 
65 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 1461 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

G GTA CCT GTA TCG AAA GAA GCA ACC ACC ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe 

TGT GCA TCA GAT GCT AAA GCA TAT GAT ACA GAG GTA CAT 7 6 
Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His 
15 20 25 

15 AAT GTT TGG GCT ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 

Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Ago Pro 
30 35 

AAC CCA CAA GAA GTA GTA TTG GAA AAT CTA ACA GAA AAT 154 
20 Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu Asn 

40 45 so 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
25 55 60 

GAG GAT ATA ATC AGT TTA TGG GAT CAA ACT CTA AAG CCA 232 

Glu Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro 

65 70 75 

30 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
80 85 90 

35 ACT AAT TTG GAG AAT GCT AAT AAT ACC GAG AAT GCT AAT 310 

Thr Asn Leu Glu Asn Ala Asn Asn Thr Glu Asn Ala Asn 
95 100 



40 



60 



AAT ACC AAT AAT TAT ACC TTG GGG ATG GAG AG A GCT GAA 349 
Asn Thr Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu 
105 HO 115 



ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA ACC TTA ACA 388 

He Lys Asn Cys Ser Phe Asn He Thr Thr Ser Leu Arg 

45 120 125 

GAT AAG GTG AAA AAA GAA TAT GCA TTG TTT TAT AAA CTT 42 7 

Asp Lys Val Lys Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 

130 135 iqo 

50 

GAT GTA GTA CAA ATA GAT AAT AGT ACC AAC TAT AGG CTG 466 

Asp Val Val Gin He Asp Asn Ser Thr Asn Tyr Arg Leu 

145 150 155 

55 ATA AGT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 

He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro 

160 165 



AAG GTA TCC TTT GAG CTA ATT CCC ATA CAT TAT TGT GCC 544 
Lys Val Ser Phe Glu Leu He Pro He His Tyr Cys Ala 
170 175 180 



CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG AAG 583 
Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys 
65 185 190 
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TTC AAT GGA ACA GGA CCA TGT AAA AAT GTC AGC ACA GTA 622 

Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val 

195 200 205 

5 CAA TGT ACA CAT GGA ATT AGA CCA GTA GTA TCA ACT CAA 661 

Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin 
210 215 220 

CTA CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG ATA GTA 700 

10 Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu lie Val 

225 230 

ATT AGA TCT GAA AAT ATC ACA GAC AAT GCT AAA ACC ATA 739 

lie Arg Ser Glu Asn lie Thr Asp Asn Ala Lys Thr lie 
15 235 240 245 

ATA GTG CAG CTA AAT GAA TCT ATA GTG ATT AAT TGT ACA 778 

lie Val Gin Leu Asn Glu Ser lie Val He Asn Cys Thr 
250 255 



20 



40 



60 



AGA CCC AAT A AC AAC ACA AGA AAA AGT ATA AAT ATA GGA 817 
Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Asn lie Gly 
260 265 270 



2 5 CCA GGG AGA GCA TTC TAT ACA ACA GGA GAC ATA ATA GGA 856 

Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp He He Gly 

275 280 285 

GAT ATA AGA CAA GCA CAT TGT AAC CTT AGT AAA ACA CAA 895 

30 Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin 

290 295 

TGG GAA AAA ACG TTA AGA CAG ATA GCT ATA AAA TTA GAA 934 

Trp Glu Lys Thr Leu Arg Gin He Ala He Lys Leu Glu 
35 300 305 310 

GAA AAA TTT AAG AAT AAA ACA ATA GCC TTT AAT AAA TCC 973 

Glu Lys Phe Lys Asn Lys Thr lie Ala Phe Asn Lys Ser 
315 320 



TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT AAT 1012 
Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn 
325 330 335 



4 5 TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA AAA CTG 1051 

Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu 
340 345 350 

TTT AAT AGT ACC TGG AAT TTA ACA CAA CCG TTT AGT AAT 1090 
50 Phe Asn Ser Thr Trp Asn Leu Thr Gin Pro Phe Ser Asn 

355 360 

ACC GGG AAT CGT ACT GAA GAG TTA AAT ATT ACA CTC CCA 1129 
Thr Gly Asn Arg Thr Glu Glu Leu Asn He Thr Leu Pro 
55 365 370 375 

TGC AGA ATA AAA CAA ATC ATA AAC TTG TGG CAG GAA GTA 1168 
Cys Arg He Lys Gin He He Asn Leu Trp Gin Glu Val 
380 385 



GGC AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA CAA ATT 1207 
Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin He 
390 395 400 
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AGA TGT TCA TCA AAT ATT ACA GGG CTA CTA TTA ACA AG A 1246 
Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg 
405 410 415 

5 GAT GGT GGA AGT AAC ACC GGT GAC AAC AGG ACT GAG ACC 1285 

Asp Gly Gly Ser Aan Thr Gly Asp Asn Arg Thr Glu Thr 
420 425 

TTT AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA 132 4 
10 Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
430 435 440 

AGT GAA TTA TAT AAA TAT AAA GTA GTA AGA ATT GAA CCA 1363 
Ser Glu Leu Tyr Lys Tyr Lys Val Val Arcj lie Glu Pro 
15 445 450 

TTA GGA GTA GCA CCC ACC CAG GCA AAG AGA AGA GTG GTG 1402 
Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arg Val Val 
455 460 465 

20 

CAA AGA GAA AAA AGA GCA GTG GGG ATA GGA CCT ATG TTC 1441 
Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Met Phe 
470 475 480 

25 CTT GGG TTC TTG GGA GAT AA 14 61 

Leu Gly Phe Leu Gly Asp 
485 486 

(2) INFORMATION FOR SEQ ID NO: 6: 
3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:G: 



35 



50 



Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser 
15 10 15 



Asp Ala Lys Ala Tyr Asp Thr Giu Val His Asn Val Trp Ala Thr 
40 20 25 30 

His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu 
35 40 45 

45 Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 

50 55 60 



Glu Gin Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 

65 70 75 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

80 85 90 



Thr Asn Leu Glu Asn Ala Asn Asn Thr Glu Asn Ala Asn Asn Thr 

55 95 100 105 

Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu lie Lys Asn Cys 

110 115 120 

60 Ser Phe Asn He Thr Thr Ser Leu Arg Asp Lys Val Lys Lys Glu 

125 130 135 



65 



Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Gin lie Asp Asn Ser 
140 145 150 
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Thr Asn Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin 

155 160 165 

Ala Cys Pro Lys Val Ser Phe Glu Leu He Pro He Hi3 Tyr Cys 

5 170 175 180 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys Phe 

185 190 195 

10 Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr 

200 205 210 



15 



30 



45 



60 



His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 

215 220 225 

Ser Leu Ala Glu Glu Glu He Val He Arg Ser Glu Asn He Thr 

230 235 240 



Asp Asn Ala Lys Thr He He Val Gin Leu Asn Glu Ser He Val 
20 245 250 255 

lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Asn 
260 265 270 

25 He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp lie lie Gly 

275 280 285 



Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin Trp Glu 

290 295 300 

Lys Thr Leu Arg Gin lie Ala lie Lys Leu Glu Glu Lys Phe Lys 

305 310 315 



Asn Lys Thr He Ala Phe Asn Lys Ser Ser Gly Gly Asp Pro Glu 

35 320 325 330 

He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

335 340 345 

40 Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Leu Thr Gin Pro 

350 355 360 



Phe Ser Asn Thr Gly Asn Arg Thr Glu Glu Leu Asn lie Thr Leu 

365 370 375 

Pro Cys Arq lie Lys Gin lie lie Asn Leu Trp Gin Glu Val Gly 

380 385 390 



Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin lie Arg Cys Ser 

50 395 400 405 

Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser Asn 

410 415 420 

55 Thr Gly Asp Asn Arg Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp 

425 430 435 



Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 

440 445 450 

Arg lie Glu Pro Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arg 

455 460 465 



Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Met Phe 
65 470 475 480 
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Leu Gly Phe Leu Gly Asp 
485 486 

(2) INFORMATION FOR SEQ ID NO: 7: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

G GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT 37 
VaI Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe 
15 1 5 10 

TGT GCA TCA GAT GCT AAA GCA TAT GAT ACA GAG GTA CAT 7 6 
Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His 

20 15 20 25 

AAT GTT TGG GCT ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 
30 35 

25 AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA ACA GAA AAT 154 

Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu Asn 
40 45 so 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
55 60 

GAG GAT ATA ATC AGT TTA TGG GAT CAA ACT CTA AAG CCA 2 32 
Glu Asp lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro 
35 65 70 75 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 
Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
40 80 85 90 

ACT AAT TTG GAG AAT GCT AAT AAT ACC GAG AAT GCT AAT 310 
Thr Asn Leu Glu Asn Ala Asn Asn Thr Glu Asn Ala Asn 
95 100 

4 5 AAT ACC AAT AAT TAT ACC TTG GGG ATG GAG AGA GGT GAA 349 

Asn Thr Asn Asn Tyr Thr Leu Gly Met Glu Arq Gly Glu 
105 no us 



30 



50 



60 



AGA AAA AAC TGC TCT TTC AAT ATC ACC ACA AGC TTA AGA 388 
Arg Lys Asn Cys Ser Phe Asn lie Thr Thr Ser Leu Arq 
120 125 



GAT AAG GGG AAA AAA GAA TAT GCA TTG TTT TAT AAA CTT 427 

Tyr 
140 



Asp Lys Gly Lys Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
oo 130 135 7 



GAT GTA GTA CAA ATA GAT AAT AGT ACC AAC TAT AGG CTG 466 
Asp Val Val Gin He Asp Asn Ser Thr Asn Tyr Arq Leu 
145 150 155 

ATA AGT TGT AAT ACC TCA GTC ATT ACA CAC GCC TGT CCA 505 
He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro 
160 165 
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AAG GTA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC 544 
Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys Ala 
170 175 180 

5 CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG AAG 583 

Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp Lys Lys 
185 190 

TTC AAT GGA ACA GGA CCA TGT AAA AAT GTC AGG ACA GTA 622 
10 Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Arg Thr Val 
195 200 205 

CAA TGT ACA CAT GGA ATT AG A CCA GTA GTA TCA ACT CAA 661 
Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin 
15 210 215 220 

CTA CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG ATA GTA 700 
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu He Val 
225 230 



20 



40 



60 



ATT AG A TCT GAA AAT ATC ACA GAC AAT GCT AAA ACC ATA 7 39 
He Arg Ser Glu Asn He Thr Asp Asn Ala Lys Thr He 
235 240 245 



2 5 ATA GTG CAG CTA AAT GAA TCT ATA GTG ATT AAT TGT ACA 7 78 

He Val Gin Leu Asn Glu Ser He Val He Asn Cys Thr 
250 255 

AG A CCC AAT AAC AAC ACA AGA AAA AGT ATA AAT ATA GGA 817 

30 Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Asn He Gly 

260 265 270 

CCA GGG AGA GCA TTC TAT ACA ACA GGA GAC ATA ATA GGA 856 

Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp He He Gly 

35 275 280 285 

GAT ATA AGA CAA GCA CAT TGT AAC CTT AGT AAA ACA CAA 895 

Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin 
290 295 



TGG GAA AAA ACG TTA AGA CAG ATA GCT ATA AAA TTA GAA 934 
Trp Glu Lys Thr Leu Arg Gin He Ala lie Lys Leu Glu 
300 305 310 



45 GAA AAA TTT AAG AAT AAA ACA ATA GCC TTT AAT AAA TCC 973 

Glu Lvs Phe Lys Asn Lys Thr He Ala Phe Asn Lys Ser 
315 320 

TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT AAT 1012 

50 Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn 

325 330 335 

TGT GGA GGG GGA TTT TTC TAC TGT AGT ACG AGA AAA CTG 1051 

Cys Gly Gly Gly Phe Phe Tyr Cys Ser Thr Arg Lys Leu 

55 340 345 350 

TTT AAT AGT ACC TGG AAT TTA ACA CAA CCG TTT AGT AAT 1090 

Phe Asn Ser Thr Trp Asn Leu Thr Gin Pro Phe Ser Asn 
355 360 



ACC GGG GAT CGT ACT GAA GAG TTA AAT ATT ACA CTC CCA 1129 
Thr Gly Asp Arg Thr Glu Glu Leu Asn He Thr Leu Pro 
365 370 375 
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10 



20 



TGC AGA ATA AAA CAA ATC ATA AAC TTG TGG CAG GAA GTA 1168 
Cys Arg He Lys Gin He He Asn Leu Trp Gin Glu Val 
380 385 

GGC AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA CAA ATT 1207 
Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin He 
390 395 400 

AGA TGT TCA TCA AAT ATT ACA GGG CTA CTA TTA AGG AGA 124 6 
Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Arg Arg 
405 410 415 

GAT GGT GGA AGT AAC ACC AGT GAC AAC CAG ACT GAG ACC 1285 
Asp Gly Gly Ser Asn Thr Ser Asp Asn Gin Thr Clu Thr 
15 420 425 

TTT AGA CCT GGG GGA GGA GAT ATG AGG GAC A AG TGG AGA 132 4 
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Lys Trp Arq 
430 435 440 

AGT GAA TTA TAT AAA TAT AAA GTA GTA AGA ATT GAA CCA 1363 
Ser Glu Leu Tyr Lys Tyr Lys Val Vai Arcj lie Glu Pro 
445 450 

2 5 TTA GGA GTA GCA CCC ACC CAG GCA A AG AGA ACA GTG GTG 1402 

Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arg Val Val 
455 460 465 

CAA AGA GAA AAA AGA GCA GTG GGG ATA GGA CCT ATC TTC 1441 
30 Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Met Phe 

470 475 480 

CTT AGG TTC TTA GGA GAT AAA GCT TCT AGA GTC 14 74 
Leu Arg Phe Leu Gly Asp Lys Ala Ser Arg Val 
35 485 490 491 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 
40 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Pro Val Trp Lys Glu Ala Thr Thr Thr Lpu Phe Cys Ala Ser 
45 1 5 10 is 

Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr 
20 25 30 

His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu 
35 40 45 

Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 
50 55 60 

Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
60 80 85 go 

Thr Asn Leu Glu Asn Ala Asn Asn Thr GJu Asn Ala Asn Asn Thr 
95 100 10 5 



50 



55 
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Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu Arg Lys Asn Cys 
110 115 120 

Ser Phe Asn He Thr Thr Ser Leu Arg Asp Lys Gly Lys Lys Glu 
125 130 135 

Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Gin He Asp Asn Ser 
140 145 150 

Thr Asn Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin 
155 160 165 

Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys 
170 175 180 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys Phe 
185 190 195 

Asn Gly Thr Gly Pro Cys Lys Asn Val Arg Thr Val Gin Cys Thr 
200 205 210 

His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 
215 220 225 

Ser Leu Ala Glu Glu Glu He Val lie Arg Ser Glu Asn He Thr 
230 235 240 

Asp Asn Ala Lys Thr He He Val Gin Leu Asn Glu Ser He Val 
245 250 255 

He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Asn 
260 265 270 

He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp He He Gly 
275 280 285 

Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin Trp Glu 
290 295 300 

Lys Thr Leu Arg Gin He Ala He Lys Leu Glu Glu Lys Phe Lys 
305 310 315 

Asn Lys Thr He Ala Phe Asn Lys Ser Ser Gly Gly Asp Pro Glu 
320 325 330 

He Val Met His Ser Phe Asn Cys Gly Gly Gly Phe Phe Tyr Cys 
335 340 345 

Ser Thr Arg Lys Leu Phe Asn Ser Thr Trp Asn Leu Thr Gin Pro 
350 355 360 

Phe Ser Asn Thr Gly Asp Arg Thr Glu Glu Leu Asn He Thr Leu 
365 370 375 

Pro Cys Arg He Lys Gin He He Asn Leu Trp Gin Glu Val Gly 
380 385 390 

Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin He Arg Cys Ser 
395 400 405 

Ser Asn He Thr Gly Leu Leu Leu Arg Arg Asp Gly Gly Ser Asn 
410 415 420 

Thr Ser Asp Asn Gin Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp 
425 430 435 
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Met Arg Asp Lys Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
440 445 450 

Arg lie Glu Pro Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arq 
5 455 460 465 

Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly Ala Met Phe 
470 475 480 

10 Leu Arg Phe Leu Gly Asp Lys Ala Ser Arg Val 

485 490 491 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1512 base pairs 

( B ) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

20 

CTC GAG GTA CCT GTA TGG AAA GAA GCA ACT ACC ACT 36 
Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 
1 5 io 

25 CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT A AT ACA GAG 75 

• Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asn* Thr Glu 
15 20 25 



30 



AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 
Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 
30 35 



GAT CCC AAC CCA CAA GAA GTA GTA TTG GGA AAT GTG ACA 153 
Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr 
35 40 45 50 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA 192 

Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 
55 60 

40 

ATG CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA AGT CTA 231 

Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

45 AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA 270 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 go 

AAT TGC ACT GAT GAT TTA GGG AAT GCT ACT AAT ACC AAT 309 
50 Asn Cys Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn 

95 100 

AGT AGT GCC ACT ACC AAT AGT AGT AGT TGG GAA GAA ATG 348 
Ser Ser Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met 
55 105 110 ns 

AAG GGG GAA ATG AAA AGA TGC TCT TTC AAT ATC ACC ACA 387 

Lys Gly Glu Met Lys Arg Cys Ser Phe Asn He Thr Thr 
120 125 

60 

AGC ATA AGA GAT AAG ATT AAG AAA GAA CAT GCA CTT TTC 4 26 

Ser He Arg Asp Lys He Lys Lys Glu His Ala Leu Phe 
130 135 140 
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TAT AG A CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACC 465 

Tyr Arg Leu Asp Val Val Pro lie Asp Asn Asp Asn Thr 
145 ISO 155 

5 ACA TAT AGG TTG ATA AAT TGT AAT ACC TCA GTC ATT ACA 504 

Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser Val lie Thr 
160 165 

CAG GCC TGT CCA AAG GTA TCA TTT GAG CCA ATT CCC ATA 543 

10 Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He 

170 175 180 

CAT TTT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT 582 

His Phe Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys 
15 185 190 

AAT AAT AAG ACG TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 

Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn 

195 200 205 



20 



40 



60 



GTC AGT ACA GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660 
Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val 
210 215 220 



25 GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 699 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 

225 230 

GAA GAG GTA ATA ATT AG A TCT GAC AAT ATC ACA GAC AAT 7 38 

30 Glu Glu Val He He Arg Ser Asp Asn He Thr Asp Asn 

235 240 245 

ACT AAA ACC ATT ATA GTA CAG CTA AAC GAA TCT GTA GTA 777 

Thr Lys Thr He He Val Gin Leu Asn Glu Ser Val Val 
35 250 255 

ATT AAT TGT ACA AG A CCC AAC AAC AAT ACA AGA AAA AGT 816 

He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 

260 265 270 



ATA CAT ATA GGA CCA GGG AGT GCA TTT TTT GCA ACA GGA 855 
He His He Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 
275 280 285 



4 5 GAA ATA ATA GGA GAT ATA AGA CAA GCA CAC TGT AAC CTT 894 

Glu He He Gly Asp He Arg Gin Ala His Cys Asn Leu 

290 295 

AGT AGA ACA CAA TGG AAT AAC ACT TTA GGA AAG ATA GTC 933 

50 Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val 

300 305 310 

ATA AAA TTA AGA GAA CAA TTT AGA AAA CAA TTT GGA GAA 972 

He Lys Leu Arg Glu Gin Phe Arg Lys Gin Phe Gly Glu 
55 315 320 

AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 

Lys Thr He Val Phe Asn Arg Ser Ser Gly Gly Asp Pro 
325 330 335 



GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA GGG GAA TTT 1050 
Glu He Ala Met His Ser Phe Asn Cys Gly Gly Glu Phe 
340 345 350 
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TTC TAC TGT AAC ACA ACA GCA CTG TTT AAT ACT ACC TGG 1089 
Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp 
355 360 

5 AAT GTT ACT AAA GGG TTG AAT AAC ACT GAA GGA AAT AGC 1128 

Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Asn Ser 
365 370 375 

ACA GGA GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA 1167 
10 Thr Gly Asp Glu Asn lie He Leu Pro Cys Arg He Lys 
380 385 

CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 
Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met 
15 390 395 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 124 5 
Tyr Ala Pro Pro lie Ser Gly Gin He Arg Cys Ser Ser 
405 410 415 

20 

AAC ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 1284 
Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 
420 42b 

25 AAG AAC GAG AGC ATC ACC ACC GAG GTC TTC AGA CCT GGA 132 3 

Lys Asn Glu Ser He Thr Thr Glu Val Phe Arg Pro Gly 
430 4 35 4 40 

GGA GGA GAT ATG AGG GAC AAT TGG ACA AGT GAA TTA TAT 1362 
30 Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

445 450 

AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GCA GTA GCG 1401 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
35 455 460 465 

CCC ACC AAG GCA AAG AGA AGA GTG CTG CAG AGA GAA AAA 1440 

Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 

470 475 480 

40 

AGA GCA GTG GGA ACA ATA GGA GCT ATG TTC CTT GGG TTC 1479 

Arg Ala Val Gly Thr He Gly Ala Met Phe Leu Gly Phe 

485 490 

4 5 TTG GGA GCA TAA AGC TTC TAG AGT CGA CCT GCA 1512 

Leu Gly Ala Xaa Ser Phe Xaa Ser Arg Pro Ala 
495 500 504 

(2) INFORMATION FOR SEQ ID NO: 10: 
50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



55 



Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
1 5 10 15 



Ala Ser Asp Ala Lys Ala Tyr Asn Thr Clu Lys His Asn Val Trp 

60 20 25 30 

Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val 

35 40 45 
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Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 
50 55 60 

Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp Asp Gin 
65 70 75 

Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

Asn Cvs Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn Ser Ser 
95 100 105 

Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met Lys Gly Glu Met 
110 115 120 

Lys Arg Cys Ser Phe Asn He Thr Thr Ser He Arg Asp Lys He 
125 130 135 

Lys Lys Glu His Ala Leu Phe Tyr Arg Leu Asp Val Val Pro He 
140 145 150 

Asp Asn Asp Asn Thr Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser 
155 160 165 

Val lie Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro 
170 175 180 

He His Phe Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn 
185 190 195 

Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn Val Ser Thr 
200 205 210 

Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu 
215 220 225 

Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val lie lie Arg Ser 
230 235 240 

Asp Asn He Thr Asp Asn Thr Lys Thr He He Val Gin Leu Asn 
245 250 255 

Glu Ser Val Val lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
260 265 270 

Lvs Ser lie His lie Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 
1 275 280 285 

Glu He lie Gly Asp He Arg Gin Ala His Cys Asn Leu Ser Arg 
290 295 300 

Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val lie Lys Leu Arg 
305 310 315 

Glu Gin Phe Arg Lys Gin Phe Gly Glu Lys Thr lie Val Phe Asn 
320 325 330 

Arq Ser Ser Gly Gly Asp Pro Glu He Ala Met His Ser Phe Asn 
335 340 345 

Cvs Glv G'y Glu Phe Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn 
350 355 360 

Ser Thr Trp Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Asn 
365 370 375 
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10 



15 



Ser Thr Gly Asp Glu Asn lie He Leu Pro Cys Arg He Lys Gin 
3 *° 385 390 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 
395 400 405 

Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
410 415 420 

Leu Leu Thr Arg Asp Gly Gly Ser Lys Asn Glu Ser He Thr Thr 
425 430 435 

Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
440 445 450 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly 
455 460 465 

Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
20 470 475 430 

Arg Ala Val Gly Thr lie Gly Ala Met Phe Leu Gly Phe Leu Gly 
485 490 495 

25 Ala Xaa Ser Phe Xaa Ser Arg Pro Ala 

500 504 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS; 
30 (A) LENGTH; 1501 base pairs 

<B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l^- 

35 

CTC GAG GTA CCT GTG TGG AAA GAA GCA ACT ACC ACT 36 
Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 
1 5 10 

4 0 CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT AAT ACA GAG 75 

Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu 
15 20 25 

AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 
45 Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 

30 35 

GAT CCC AAC CCA CAA GAA GTA GTA TTG GGA AAT GTG ACA 153 
Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr 
50 40 45 50 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA 19 2 
Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 
55 60 



55 



60 



ATG CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA AGT CTA 231 
Met His Glu Asp He lie Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA 270 
Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 
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AAT TGC ACT GAT GAT TTA GGG AAT GCT ACT AAT ACC AAT 309 

Asn Cys Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn 
95 100 

5 AGO AGT GCC ACT ACC AAT AGT AGT AGT TGG GAA GAA ATG 348 

Ser Ser Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met 

105 HO 115 

AAG GGG GAA ATG AAA AGG TGC TCT TTC AAT ATC ACC ACA 387 

10 Lys Gly Glu Met Lys Arg Cys Ser Phe Asn lie Thr Thr 
120 125 

AGC ATA AG A GAT AAG ATT AAG AAA GAA CAT GCA CTT TTC 426 

Ser lie Arg Asp Lys lie Lys Lys Glu His Ala Leu Phe 
15 130 135 140 

TAT AG A CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACC 465 

Tyr Arg Leu Asp Val Val Pro lie Asp Asn Asp Asn Thr 
145 150 155 



20 



40 



60 



ACA TAT AGG TTG ATA AAT TGT AAT ACC TCA GTC ATT ACA 504 
Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser Val lie Thr 
160 165 



2 5 CAG GCC TGT CCA AAG GTA TCA TTT GAG CCA ATT CCC ATA 54 3 

Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro lie 
170 175 180 

CAT TTT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT 582 
30 His Phe Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys 
185 190 

AAT AAT AAG ACG TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 
Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn 
35 195 200 205 

GTC AGT ACA GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660 
Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val 
210 215 220 



GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 699 
Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
225 230 



4 5 GAA GAG GTA ATA ATT AGA TCT GGC AAT ATC ACA GAC AAT 738 

Glu Glu Val He He Arg Ser Gly Asn He Thr Asp Asn 

235 240 245 

ACT AAA ACC ATT ATA GTA CAG CTA A AC GAA TCT GTA GTA 777 

50 Thr Lys Thr He He Val Gin Leu Asn Glu Ser Val Val 
250 255 

ATT AAT TGT ACA AGA TCC AAC AAC AAT ACA AGA AAA AGT 816 

He Asn Cys Thr Arg Ser Asn Asn Asn Thr Arg Lys Ser 

55 260 265 270 

ATA CAT ATA GGA CCA GGG AGT GCA TTT TTT GCA ACA GGA 855 

He His He Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 
275 280 285 



GAA ATA ATA GGA GAT ATA AGA CAA GCA CAC TGT AAC CTT 894 
Glu He He Gly Asp He Arg Gin Ala His Cys Asn Leu 
290 295 
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AGT AGA ACA CAA TGG AAT A AC ACT TTA GGA AAG ATA GTC 933 
Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val 
300 305 310 

5 ATA AAA TTA AGA GAA CAA TTT AGA AAA CAA TTT GGA GAA 972 

He Lys Leu Arg Glu Gin Phe Arg Lys Gin Phe Gly Glu 
315 320 

AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 
10 Lys Thr He Val Phe Asn Arg Ser Ser Gly Cly Asp Pro 
325 330 335 

GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA GGG GAA TTT 1050 
Glu He Ala Met His Ser Phe Asn Cys Gly Cly Glu Phe 
15 340 345 350 

TTC TAC TGT AAC ACA ACA GCA CTG TTT AAT AGT ACC TGG 1089 

Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp 
355 360 

20 

AAT GTT ACT AAA GGG TTG AAT AAC ACT GAA GGA AAT AGC 1128 

Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Asn Ser 

365 370 375 

25 ACA GGG GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA 1167 

Thr Gly Asp Glu Asn lie He Leu Pro Cys Arq lie Lys 
380 385 

CAA ATT ATA AAC ATG TCC CAG GAA GTA CCA AAA GCA ATG 1206 
30 Gin He lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 

390 395 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 124 5 
Tyr Ala Pro Pro He Ser Gly Gin lie Arg Cys Ser Ser 
35 405 410 415 

AAT ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 1284 

Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 
420 425 

40 

AAG AAC GAG AGC ATC ACC ACC GAG GTC TTC AGA CCT GGA 1323 

Lys Asn Glu Ser He Thr Thr Glu Val Phe Arg Pro Gly 

430 435 440 

45 GGA GGA GAT ATG AGG GAC AAT TCG AGA AGT GAA TTA TAT 1362 

Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
445 450 

AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA CGA GTA GCG 1401 
50 Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala 

455 460 465 

CCC ACC AAG GCA AAG AGA AGA GTG GTC CAG AGA GAA AAA 1440 
Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
55 470 475 480 

AGA GCA GTG GGA ACA ATA GGA GCT ATG TTC CTT GGG TTC 1479 
Arg Ala Val Gly Thr lie Gly Ala Met Phe Leu Gly Phe 
485 490 



60 



TTA GGA GCA TAA AGC TTC TAG A 1501 
Leu Gly Ala Xaa Ser Phe Xaa 
495 500 
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10 



25 



30 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
I 5 10 15 

Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu Lys His Asn Val Trp 
20 25 30 



Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val 

15 35 40 45 

Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 

50 55 60 

20 Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gin 

65 70 75 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 

80 85 90 

Asn Cvs Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn Ser Ser 

95 100 105 

Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met Lys Gly Glu Met 

110 115 120 

Lvs Arq Cys Ser Phe Asn lie Thr Thr Ser He Arg Asp Lys He 

125 130 135 

35 Lvs Lvs Glu His Ala Leu Phe Tyr Arq Leu Asp Val Val Pro lie 

7 140 f 145 150 

Asp Asn Asp Asn Thr Thr Tyr Arg Leu He Asn Cys Asn Thr Ser 

^ 155 160 165 



Val- He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro 
170 - 175 180 



He His" Phe Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn 

45 185 190 195 

Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn Val Ser Thr 

200, 205 210 

50 Val Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu 

215 220 225 



Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val He lie Arg Ser 
230 235 240 

Glv Asn He Thr Asp Asn Thr Lys Thr He He Val Gin Leu Asn 
245 250 255 



Glu Ser Val Val He Asn Cys Thr Arg Ser Asn Asn Asn Thr Arg 

60 260 265 270 

Lys Ser He His He Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 

275 280 285 
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Glu lie He Gly Asp He Arg Gin Ala His Cys Asn Leu Ser Arg 

290 295 300 

Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val He Lys Leu Arg 

5 305 310 315 

Glu Gin Phe Arg Lys Gin Phe Gly Glu Lys Thr He Val Phe Asn 

320 325 330 

10 Arg Ser Ser Gly Gly Asp Pro Glu He Ala Met His Ser Phe Asn 

335 340 345 



15 



30 



40 



45 



60 



Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn 

350 355 360 

Ser Thr Trp Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Asn 

365 370 375 



Ser Thr Gly Asp Glu Asn ile Ile Leu Pro Cys Arg He Lys Gin 
20 380 385 390 

Ile Ile Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 
395 400 405 

25 Pro Ile Ser Gly Gin Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu 

410 415 420 



Leu Leu Thr Arg Asp Gly Gly Ser Lys Asn Glu Ser Ile Thr Thr 
425 430 435 

Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
440 445 450 



Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly 
35 455 460 465 



Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 

470 475 480 

Arg Ala Val Gly Thr Ile Gly Ala Met Phe Leu Gly Phe Leu Gly 

485 490 495 

Ala Xaa Ser Phe Xaa 
500 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1514 base pairs 

( B ) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

55 GG GAA TTC GGA TCC GGG GTA CCT GTG TGG AAG GAA GCA 38 

Glu Phe Gly Ser Gly Val Pro Val Trp Lys Glu Ala 
1 5 10 



ACC ACC ACT CTA TTC TGT GCA TCA GAT GCT AG A GCA TAT 77 
Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Arg Ala Tyr 
15 20 25 



GAC ACA GAG GTA CAT A AT GTT TGG GCC ACA CAT GCC TGT 116 
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys 
65 30 35 
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GTA CCC ACA GAC OCT AGT CCA CAA GAA GTA GTT TTG GAA 155 
Val Pro Thr Asp Pro Ser Pro Gin Glu Val Val Leu Glu 
40 45 50 

5 AAT GTG ACA GAA AAT TTT AAC ATG TGG AAA AAT A AC ATG 194 

Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met 
55 60 

GTA GAA CAA ATG CAT GAG GAT ATA ATT AGT TTA TGG GAT 233 
10 Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp 

65 70 75 

CAA AGC TTA AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT 272 
Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
15 80 85 90 

GTT ACT TTA AAT TGC AGT GAT TAT AGG AAT GCT ACT GAT 311 

Val Thr Leu Asn Cys Ser Asp Tyr Arg Asn Ala Thr Asp 

95 100 

20 

TAT AAG AAT GCT ACT GAT ACC ACT AGT AGT AAC GAG GGA 350 

Tyr Lys Asn Ala Thr Asp Thr Thr Ser Ser Asn Glu Gly 
105 HO 115 

2 5 AAG ATG GAG AGA GGA GAA ATA AAA AAC TGC TCT TTC AAT 389 

Lys Met Glu Arg Gly Glu lie Lys Asn Cys Ser Phe Asn 
120 125 

ATT ACC ACA AGC ATA AAA AAT AAG ATG CAG AAA GAA TAT 4 28 
30 He Thr Thr Ser He Lys Asn Lys Met Gin Lys Glu Tyr 

130 135 140 

GCA CTT TTC TAT AAA CTT GAT ATA GTA CCA ATA GAT AAT 467 
Ala Leu Phe Tyr Lys Leu Asp He Val Pro lie Asp Asn 
35 145 150 155 

ACA AGC TAT ACA TTG ATA AGT TGT AAC ACC TCA GTC ATT 506 
Thr Ser Tyr Thr Leu He Ser Cys Asn Thr Ser Val He 
160 165 

40 

ACA CAG GCC TGT CCA AAG GTA TCC TTT GAA CCA ACT CCC 545 
Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro Thr Pro 
170 175 180 

4 5 ATA CAT TAT TGT GCT CCG GCT GGT TTT GCG ATT CTA AAG 584 

He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys 
185 190 

TGT AAT GAT AAG AAG TTC AGT CGA AAA GGA GAA TGT AAA 62 3 
50 Cys Asn Asp Lys Lys Phe Ser Gly Lys Gly Glu Cys Lys 

195 200 205 

AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 662 
Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
55 210 215 220 

GTA GTA TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA 701 
Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala 
225 230 



60 



GAA GAA GAG GTG GTA ATT AGA TCT GAC AAT TTC ATA GAC 740 
Glu Glu Clu Val Val He Arg Ser Asp Asn Phe He Asp 
235 240 245 
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AAT ACT AAA ACC ATA ATA GTA CAG CTG AAA GAA TCT GTA 779 
Asn Thr Lys Thr He He Val Gin Leu Lys Glu Ser Val 
250 255 

5 GAA ATT AAT TGT ATA AG A CCC AAC AAT AAT ACA AG A AAA 818 

Glu He Asn Cys He Arg Pro Asn Asn Asn Thr Arg Lys 
260 265 270 

GGT ATA CAT ATA GGA CCA GGG AG A GCA TGG TAT GCA ACA 857 
10 Gly He His He Gly Pro Gly Arg Ala Trp Tyr Ala Thr 

275 280 285 

GGA GAA ATA GTA GGA GAT ATA AG A A AG CCA TAT TGT AAC 896 
Gly Glu He Val Gly Asp He Arg Lys Ala Tyr Cys Asn 
15 290 295 



20 



ATT AGT ACA ACA AAA TGG AAT AAC ACT TTA ATA CAG ATA 935 

He Ser Arg Thr Lys Trp Asn Asn Thr Leu He Gin lie 
300 305 3io 

GCT AAC AAA TTA AAA GAA AAA TAT AAT ACA ACA ATA AGC 974 

Ala Asn Lys Leu Lys Glu Lys Tyr Asn Thr Thr lie Ser 
315 320 

25 TTT AAT CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG 1013 

Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu lie Val Thr 

325 330 335 



30 



60 



CAT AGT TTT AAT TGT GGA GGG GAG TTT TTC TAC TGT GAT 1052 
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp 
• 340 345 350 



TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT TTA AAT GGT 1091 
Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Leu Asn Gly 
35 355 360 

ACT TGG AAT TTT ACT GCA GGG TCA AAT GAA ACT CAA GGC 1130 

Thr Trp Asn Phe Thr Ala Gly Ser Asn Glu Thr Glu Gly 
365 370 375 

40 

AAT ATC ACA CTC CCA TGC AG A ATA AAA CAA ATT ATA AAC 1169 

Asn He Thr Leu Pro Cys Arg He Lys Gin lie lie Asn 

380 385 

45 AGG TGG CAG GAA GTA GGG AAA GCA ATG TAT GCC CCT CCC 1208 

Arg Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
390 395 400 

ATC AGT GGA CAA ATA AAA TGC TCA TCA AAC ATT ACA GGG 124 7 
50 He Ser Gly Gin He Lys Cys Ser Ser Asn He Thr Gly 

405 410 415 

ATG ATA TTA ACA AGG GAT GGT GGT AAC GAG AAC AAT AAT 1286 
Met He Leu Thr Arg Asp Gly Gly Asn Glu Asn Asn Asn 
55 420 425 

GAG AGC AGT ACT ACT CAC ACC TTC AGA CCG GGA GCA GGA 1325 
Glu Ser Ser Thr Thr Glu Thr Phe Arcj Pro Gly Gly Gly 
430 435 4/10 



GAT ATG AGG AAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 1364 
Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
445 450 
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AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1403 
Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
455 460 465 

5 AAG GCA AAG AGA AG A GTG GTG CAG AG A GAA AAA AG A GCA 1442 

Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
470 475 480 

GTG GGA GCG CTA GGA GCT ATG TTC CTT GGG TTC TTA GGA 1481 
10 Val Gly Ala Leu Gly Ala Met Phe Leu Gly Phe Leu Gly 

485 490 

GCA TAA AGC TTC TAG ACC GAC TCT AGA GGA TCC 1514 
Ala Xaa Ser Phe Xaa Thr Asp Ser Arg Gly Ser 
15 495 500 504 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Phe Gly Ser Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr 
25 1 5 10 15 

Leu Phe Cys Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val His 
20 25 30 

30 Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Ser Pro 

35 40 45 



35 



50 



Gin Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp 

50 55 60 

Lys Asn Asn Met Val Glu Gin Met His Glu Asp He He Ser Leu 

65 70 75 



Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 

40 80 85 90 

Val Thr Leu Asn Cys Ser Asp Tyr Arg Asn Ala Thr Asp Tyr Lys 

95 100 105 

45 Asn Ala Thr Asp Thr Thr Ser Ser Asn Glu Gly Lys Met Glu Arg 

110 115 120 



Gly Glu He Lys Asn Cys Ser Phe Asn He Thr Thr Ser He Lys 

125 130 135 

Asn Lys Met Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp He 

140 145 150 



Val Pro He Asp Asn Thr Ser Tyr Thr Leu He Ser Cys Asn Thr 

55 155 160 165 

Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro Thr 

170 175 180 

60 Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys 

185 190 195 



Asn Asp Lys Lys Phe Ser Gly Lys Gly Glu Cys Lys Asn Val Ser 
200 205 210 



65 
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Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin 
215 220 225 

Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg 
5 230 235 240 

Ser Asp Asn Phe lie Asp Asn Thr Lys Thr He He Val Gin Leu 
245 250 255 

10 Lys Glu Ser Val Glu He Asn Cys He Arg Pro Asn Asn Asn Thr 

260 265 270 



15 



25 



30 



60 



Arg Lys Gly He His lie Gly Pro Gly Arg Ala Trp Tyr Ala Thr 

275 280 285 

Gly Glu He Val Gly Asp He Arg Lys Ala Tyr Cys Asn lie Ser 

290 295 300 



Arg Thr Lys Trp Asn Asn Thr Leu lie Cln He Ala Asn Lys Leu 
20 305 310 315 



Lys Glu Lys Tyr Asn Thr Thr He Ser Phe Asn Arg Ser Ser Gly 

320 325 330 

Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys Gly Gly Glu 

335 340 345 

Phe Phe Tyr Cys Asp Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn 

350 355 360 

Leu Asn Gly Thr Trp Asn Phe Thr Ala Gly Ser Asn Glu Thr Glu 

365 370 375 



Gly Asn lie Thr Leu Pro Cys Arg He Lys Gin He lie Asn Arg 
35 380 385 390 

Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly 
395 400 405 

40 Gin He Lys Cys Ser Ser Asn lie Thr Gly Met lie Leu Thr Arg 

410 415 420 

Asp Gly Gly Asn Glu Asn Asn Asn Glu Ser Ser Thr Thr Glu Thr 

425 430 435 

45 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Ser Glu 

440 445 450 

Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
50 455 460 465 

Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
470 475 480 

55 Val Gly Ala Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Xaa 

485 490 495 



Ser Phe Xaa Thr Asp Ser Arg Gly Ser 
500 504 



(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1408 base pairs 

(B) TYPE: Nucleic Acid 
65 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

G GTA CCT GTG TGG A AG GAA GCA ACC ACC ACT CTA TTC 37 
5 Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe 

15 10 

TGT GCA TCA GAT GCT AG A GCA TAT GAC AC A GAG GTA CAT 76 
Cys Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val His 
10 15 20 25 

AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC CCT 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 
30 35 



15 



35 



55 



AGT CCA CAA GAA GTA TTT TTC GGA AAT GTG ACA GAA AAT 154 
Ser Pro Gin Glu Val Phe Leu Gly Asn Val Thr Glu Asn 
40 45 50 



20 TTT AAT ATG TGG AAA AAT A AC ATG GTA GAA CAA ATG TAT 193 

Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Tyr 
55 60 

GAG GAT ATA ATT AGT TTA TGG GAT CAA AGC TTA AAG CCA 2 32 

25 Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro 
65 70 75 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

30 80 85 90 

AGT GAT TAT AGG AAT GCT ACT GAT TAT AAG AAT GCT ACT 310 

Ser Asp Tyr Arg Asn Ala Thr Asp Tyr Lys Asn Ala Thr 
95 100 



GAT ACC ACT AGT AGT AAC GAG GGA AAG ATG GAG AGA GGA 349 
Asp Thr Thr Ser Ser Asn Glu Gly Lys Met Glu Arg Gly 
105 HO 115 



4 0 GAA ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA AGC ATA 388 

Glu He Lys Asn Cys Ser Phe Asn He Thr Thr Ser lie 
120 125 

AAA AAT AAG ATG CAG AAA GAA TAT GCA CTT TTC TAT AAA 427 
4 5 Lys Asn Lys Met Gin Lys Glu Tyr Ala Leu Phe Tyr Lys 

130 135 140 

CTT AAT ATA GTA CCA ATA GAT AAT ACA AGC TAT ACA TTG 466 
Leu Asn He Val Pro He Asp Asn Thr Ser Tyr Thr Leu 
50 145 150 155 

ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 
He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro 
160 165 



AAG GTA TCC TTT GAA CCA ATT CCC ATA CAT TAT TGT GCT 544 
Lys Val Ser Phe Glu Pro He Pro lie His Tyr Cys Ala 
170 175 180 



60 CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG AAG 583 

Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys 
185 190 
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TTC AGT GGA AAA GGA GAA TGT AAA AAT CTC AGC ACA GTA 622 
Phe Ser Gly Lys Gly Glu Cys Lys Asn Val Ser Thr Val 
195 200 205 

5 CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA 661 

Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin 
210 215 220 

CTG CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG GTG GTA 700 
10 Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Giu Val Val 

225 230 

ATT AG A TCT GAC AAT TTC ACA GAC AAT ACT AAA ACC ATA 7 39 
lie Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys Thr lie 
15 235 240 245 

ATA GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT TGT ATA 7 78 

He Val Gin Leu Lys Glu Ser Val Glu He Asn Cys He 
250 255 

20 

AGA CCC AAC AAT AAT ACA AGA AAA GGT ATA CAT ATA GGA 817 

Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His lie Gly 
260 265 270 

25 CCA GGG AGA GCA TGG TAT GCA ACA GGA GAA ATA GTA GGA 856 

Pro Gly Arg Ala Trp Tyr Ala Thr Gly Chi lie Val Gly 
275 280 285 



30 



60 



GAT ATA AGA CAG GCA TAT TGT AAC ATT AGT AGA ACA AAA 895 
Asp He Arg Gin Ala Tyr Cys Asn lie Ser Arg Thr Lys 
290 295 



TGG AAT AAC ACT TTA ATA CAG ATA GCT AAC AAA TTA AAA 9 34 
Trp Asn Asn Thr Leu He Gin He Ala Asn Lys Leu Lys 
35 300 305 310 

GAA AAA TAT AAT ACA ACA ATA AGC TTT AAT CGA TCC TCA 973 
Glu Lys Tyr Asn Thr Thr He Ser Phe Asn Arg Ser Ser 
315 • 320 

40 

GGA GGG GAC CCA GAA ATT GTA ACC CAT AGT TTT AAT TGT 1012 
Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cvs 
325 330 335 

4 5 GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT 1051 

Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Lou Phe 
340 345 350 

AAT AGT ACT TGG AAT TTA AAT GGT ACT TGG AAT TTT ACT 1090 
50 Asn Ser Thr Trp Asn Leu Asn Gly Thr Trp Asn Phe Thr 

355 360 

GCA GGG TCA AAT GAA ACT GAA GGC AAT ATC ACA CTC CCA 1129 
Ala Gly Ser Asn Glu Thr Glu Gly Asn He Thr Leu Pro 
55 365 370 375 

TGC AGA ATA AAA CAA ATT ATA AAC AGG TGG CAG GAA GTA 1168 
Cys Arg He Lys Gin He He Asn Arg Trp Gin Glu Val 
380 335 



GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA ATA 1207 
Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly Gin He 
390 395 400 
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AG A TGC TCA TCA AAC ATT ACA GCG ATG ATA TTA ACA AGG 1246 

Arg Cys Ser Ser Asn lie Thr Gly Met lie Leu Thr Arg 
405 410 415 

5 GAT GGT GGT AAC GAG AAC AAT AAT GAG AGC AGT ACT ACT 1285 

Asp Gly Gly Asn Glu Asn Asn Asn Glu Ser Ser Thr Thr 
420 425 

GAG ACC TTC AGA CCG GGA GGA GGA GAT ATG AGG AAC AAT 1324 

10 Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn 

430 435 440 

TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT 1363 

Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie 
15 445 450 

GAG CCA TTA GGA GTA GCA CCC ACC GAC TCT AGA GGA TCC 1402 

Glu Pro Leu Gly Val Ala Pro Thr Asp Ser Arg Gly Ser 
455 460 465 



20 



TCT AGA 1408 
Ser Arg 
469 



25 (2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l6: 

Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser 

2 5 10 IS 

35 Asp Ala Arg Ala Tyr Asp Thr Glu Val His Asn val Trp Ala Thr 

20 25 30 

His Ala Cys Val Pro Thr Asp Pro Ser Pro Gin Glu Val Phe Leu 

35 40 45 

40 

Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 

50 55 60 

Glu Gin Met Tyr Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu 

45 65 70 75 

Lvs Pro Cvs Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

80 85 90 

50 Ser Asp Tyr Arq Asn Ala Thr Asp Tyr Lys Asn Ala Thr Asp Thr 

F 7 95 100 105 

Thr Ser Ser Asn Glu Gly Lys Met Glu Arg Gly Glu lie Lys Asn 

110 115 120 

55 

Cys Ser Phe Asn lie Thr Thr Ser lie Lys Asn Lys Met Gin Lys 

125 130 135 

Glu Tyr Ala Leu Phe Tyr Lys Leu Asn lie Val Pro lie Asp Asn 

60 140 145 150 

Thr Ser Tyr Thr Leu He Ser Cys Asn Thr Ser Val lie Thr Gin 

155 160 165 
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Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro He His Tvr Cvs 
170 175 180 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys Phe 
185 190 !9 5 

Ser Gly Lys Gly Glu Cys Lys Asn Val Ser Thr Val Gin Cys Thr 
200 205 210 

His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Glv 
215 220 225 

Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Asp Asn Phe Thr 
230 235 240 

Asp Asn Thr Lys Thr He He Val Gin Leu Lys Glu Ser Val Glu 
245 250 255 

He Asn Cys lie Arg Pro Asn Asn Asn Thr Arg Lys Gly He His 
20 260 265 270 

He Gly Pro Gly Arg Ala Trp Tyr Ala Thr Gly Glu He Val Glv 
275 280 285 

Asp He Arg Gin Ala Tyr Cys Asn He Ser Arg Thr Lys Trp Asn 
290 295 * 300 

Asn Thr Leu He Gin He Ala Asn Lys Leu Lys Glu Lys Tyr Asn 
305 310 3i 5 

Thr Thr He Ser Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu lie 
3 20 325 330 

Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
35 335 340 345 

Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Leu Asn Gly Thr Tro 
350 355 36 5 

Asn Phe Thr Ala Gly Ser Asn Glu Thr Glu Gly Asn He Thr Leu 
365 370 375 

Pro Cys Arg lie Lys Gin He He Asn Arg Trp Gin Glu Val Gly 
45 380 385 390 

Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser 
395 400 405 

Ser Asn He Thr Gly Met lie Leu Thr Arg Asp Gly Gly Asn Glu 
50 410 415 420 

Asn Asn Asn Glu Ser Ser Thr Thr Glu Thr Phe Arq Pro Glv Glv 
425 430 7 435 

55 Gly Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys Tvr Lvs 

440 445 Y 450 

Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Asp Ser Ara 
455 460 46 | 



25 



30 



40 



60 



Gly Ser Ser Arg 
469 



(2) INFORMATION FOR SEQ ID NO: 17: 
65 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1499 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAG GTA CCT GTG TGG AAA GAA GCA ACC ACT ACT CTA 36 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
15 10 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GGG GTG 75 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Gly Val 
15 20 25 



15 CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 

30 35 

CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA GAA 153 

20 Pro Asn Pro Gin Glu lie Glu Leu Val Asn Val Thr Glu 

40 45 50 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 

Asp Phe Asn Met Trp Lys Asn Lys Met Val Asp Gin Met 

25 55 60 

CAT GAG GAT ATA ATC ACT TTA TGG GAT GAA AGC CTA AAG 231 

His Glu Asp He He Ser Leu Trp Asp Glu Ser Leu Lys 

65 70 75 



CCA TGT GTA AAG TTA ACC CCA CTT TGT GTT ACT CTA AAC 270 
Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
80 85 90 



3 5 TGC AGT GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 

Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr 

95 100 

AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT ACG 34 8 

40 Asn Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr 

105 HO 115 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 

Ala Thr Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 

45 120 125 

ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA CAC ATG AAA 426 

He Lys Asn Cys Ser Phe Asn He Thr Thr His Met Lys 

130 135 140 

50 

GAT AAG GCA CAG AAA GAA TAT GCA CTT TTT TAT AAA CTT 4 65 

Asp Lys Ala Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 

14S 150 155 

55 GAT ATA GTA CCA ATA GAT GAT AAT AAT GCC AGC TAT AGG 504 

Asp lie Val Pro He Asp Asp Asn Asn Ala Ser Tyr Arg 

160 165 

TTG ATA AGT TGT AAT ACC TCA GAC ATT ACA CAG GCC TGT 54 3 

60 Leu He Ser Cys Asn Thr Ser Asp He Thr Gin Ala Cys 

170 175 180 

CCA AAG GTG ACC TTT GAG CCA ATT CCC ATA CAT TAT TGT 582 

Pro Lys Val Thr Phe Glu Pro He Pro He His Tyr Cys 

65 185 190 
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GCC CCG GCT GGT TTT GCG ATT CTA A AG TGT AAA GAT AAG 621 
Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp Lys 
195 200 205 

AAG TTC AAT GGA ACA GGA CCA TGT TCA AAG GTC AGC ACA 660 
Lys Phe Asn Gly Thr Gly Pro Cys Ser Lys Val Ser Thr 
210 215 220 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 699 
Val Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr 
225 230 

CAA CTG TTG TTA AAT GCC AGT CTT CCA GAA GAA GAA GTA 738 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 
15 235 240 245 

GTA ATT AGA TCT GTC AAT TTC ACA GAC AAT GCT AAA ATC 777 

Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys He 
250 255 

20 

ATA ATA GTA CAG CTG AAA GAA CCT GTA GCA ATT AAT TGT 816 

lie lie Val Gin Leu Lys Glu Pro Val Ala lie Asn Cys 

260 265 270 

2 5 ACA AGA CCC AAC AAC AAT ACA AGA AAA GGT ATA CAT CTA 855 

Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His Leu 
275 280 " 285 

GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA GAA ATA ATA 894 
Gly Pro Gly Ser Thr Phe Tyr Thr Thr Giy Clu lie He 
290 295 

GGA GAC ATA AGA AAA GCA TAT TCC AAG ATT AGT AAA GAA 93 3 
Gly Asp He Arg Lys Ala Tyr Cys Lys He Ser Lys Glu 
35 300 305 310 

AAA TGG AAT AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 972 

Lys Trp Asn Asn Thr Leu Arg Gin Val Val Lys I.ys Leu 
315 320 

40 

AGA GAA CAA TTT GGG AAT AAA ACA ATA ATT TTT AAT CGA 1011 

Arg Glu Gin Phe Gly Asn Lys Thr lie He Phe Asn Arg 
325 330 3 3 E> 

45 TCC TCA GGA GGG GAC CCA GAA ATT GTA ATC CAC AGT TTT 1050 

Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe 
340 345 350 

AAC TGT GGA GGG GAG TTT TTC TAC TGT AAT ACA ACA CAA 1089 
Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gin 
355 360 



30 



50 



CTG TTT AAT AGT ACT TGG AAT AAT ACT GAA GGG ACA AAT 1128 
Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn 
55 365 370 375 

AGC ACT GAA GGA AAT AGC ACA ATC ACA CTC CCA TCC AGA 1167 
Ser Thr Glu Gly Asn Ser Thr He Thr Leu Pro Cys Arq 
380 385 



60 



ATA AAA CAA ATT ATA AAT ATG TGG CAG GAA GTA GGA AAA 1206 

" Val 
400 



He Lys Gin He He Asn Met Trp Gin Glu Val Civ Lvs 
390 395 1 7 
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GCA ACG TAT GCC CCT CCC ATC AG A GGA CGA ATT AG A TGC 1245 
Ala Thr Tyr Ala Pro Pro He Arg Gly Arg He Arg Cys 
405 410 415 

5 ATA TCA AAT ATT ACA GGA CTG CTA TTA ACA AG A GAT GGT 1284 

He Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
420 425 

GGT AGG AAT GTC ACA AAC AAT ACC GAA ACC TTC AG A CCT 1323 
10 Gly Arg Asn Val Thr Asn Asn Thr Glu Thr Phe Arg Pro 
430 435 440 

GGA GGA GGA GAC ATG AGG GAC AAT TGG AGA AGT GAA TTA 1362 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
15 445 450 

TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly He 
455 460 465 

20 

GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 
470 475 480 

2 5 AAA AGA GCA GCA CTA GGA GCC TTG TTC CTT GGG TTC TTA 1479 

Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 
485 490 

GGA GCA TAA AAG CTT CTA GA 1499 
30 Gly Ala Xaa Lys Leu Leu 
495 499 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 499 amino acids 

( B ) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

40 Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 

IS 10 15 

Ser Asp Ala Lys Ala Tyr Asp Thr Gly Val His Asn Val Trp Ala 

20 25 30 

45 

Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu He Glu 

35 40 45 

Leu Val Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn Lys Met 
50 50 55 60 

Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp Glu Ser 
65 70 75 

55 Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 

80 85 90 

Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr Asn Thr 

95 100 105 

60 

Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr Ala Thr Thr Ser 

110 115 120 

Ser Glu Glu Lys Met Glu Lys Gly Glu He Lys Asn Cys Ser Phe 
65 125 130 135 
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35 



40 



45 



50 



55 



60 



65 



Asn lie Thr Thr His Met Lys Asp Lys Ala Gin Lys Glu Tyr Ala 
140 145 150 

Leu Phe Tyr Lys Leu Asp lie Val Pro He Asp Asp Asn Asn Ala 
155 160 165 

Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Asp He Thr Gin Ala 
170 175 iso 

Cys Pro Lys Val Thr Phe Glu Pro He Pro lie His Tyr Cys Ala 
185 190 195 

Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys Phe Asn 
200 205 210 

Gly Thr Gly Pro Cys Ser Lys Val Ser Thr Val Gin Cys Thr His 
215 220 225 

Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser 
230 235 240 

Leu Ala Glu Glu Glu Val Val He Arg Ser Val Asn Phe Thr Asp 
245 250 255 

Asn Ala Lys He He He Val Gin Leu Lys Glu Pro Val Ala lie 
260 265 270 

Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly He His Leu 
275 280 285 

Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Glu He lie Gly Asp 
290 295 300 

He Arg Lys Ala Tyr Cys Lys He Ser Lys Glu Lys Trp Asn Asn 
305 310 315 

Thr Leu Arg Gin Val Val Lys Lys Leu Arg Glu Gin Phe Gly Asn 
320 325 330 

Lys Thr lie lie Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu He 
335 340 345 

Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
350 355 360 

Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr 
365 370 375 

Asn Ser Thr Glu Gly Asn Ser Thr He Thr Leu Pro Cys Arg He 
380 385 390 

Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Thr Tyr 
395 400 405 

Ala Pro Pro lie Arg Gly Arg lie Arg Cys lie Ser Asn lie Thr 
410 415 420 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Arg Asn Val Thr Asn Asn 
425 430 435 

Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
440 445 450 

Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu 
455 460 465 
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Gly He Ala Pro Thr Ly9 Ala Lys Arg Arg Val Val His Arg Asp 
470 475 480 

Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala 
5 485 490 495 

Xaa Lys Leu Leu 
499 

10 (2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1499 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GAG GTA CCT GTA TGG AAA GAA GCA ACC ACT ACT CTA 36 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 
20 1 5 10 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG 7 5 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
15 20 25 



25 



45 



CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
30 35 



3 0 CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA GAA 153 

Pro Asn Pro Gin Glu He Glu Leu Val Asn Val Thr Glu 
40 45 50 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 
35 Asp Phe Asn Met Trp Lys Asn Lys Met Val Asp Gin Met 

55 60 

CAT GAG GAT ATA ATC AGT TTA TGG GAT GAA AGC CTA AAG 231 
His Glu Asp He He Ser Leu Trp Asp Glu Ser Leu Lys 
40 65 70 75 

CCA TGT GTA AAG TTA ACC CCA CTT TGT GTT ACT CTA AAC 2 70 
Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
80 85 90 



TGC AGT GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 
Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr 
95 100 



50 AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT ACG 348 

Asn Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr 

105 HO 115 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 

55 Ala Thr Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 

120 125 

ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA CAC ATG AAA 426 

He Lys Asn Cys Ser Phe Asn He Thr Thr His Met Lys 

60 130 135 140 

GAT AAG GTA CAG AAA GAA TAT GCA CTT TTT TAT AAA CTT 4 65 

Asp Lys Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 

145 150 155 

65 
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GAT ATA GTA CCA ATA GAT GAT AAT AAT ACC AGC TAT AGG 504 
Asp lie Val Pro lie Asp Asp Asn Asn Thr Ser Tyr Arg 
160 165 

TTG ATA ACT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT 543 
Leu He Ser Cys Asn Thr Ser Val rie Thr Gin Ala Cys 
170 175 180 

CCA ATG GTG ACC TTT GAG CCA ATT CCC ATA CAT TAT TGT 582 
Pro Met Val Thr Phe Glu Pro He Pro He His Tyr Cys 
185 190 



GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG 621 
Ala Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lvs 
15 195 200 205 



AAG TTC AAT GGA ACA GGA CCA TGT TCA AAG GTC AGC ACA 660 
Lys Phe Asn Gly Thr Gly Pro Cys Ser Lys Val Ser Thr 
210 215 220 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA CTA TCA ACT 699 
Val Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr 
225 230 

25 CAA CTG TTG TTA AAT GGC AGT CTT GCA GAA GAA CAA GTA 738 

Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 
235 240 245 

GTA ATT AGA TCT GTC AAT TTC ACA CAC AAT CCT AAA ATC 7 77 
30 Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys He 

250 255 

ATA ATA GTA CAG CTG AAA GAA CCT GTA GCA ATT AAT TGT 816 
He He Val Gin Leu Lys Glu Pro Val Ala He Asn Cys 
35 260 265 270 



ACA AGA CCC AAC AAC AAT ACA AGA AAA GGT ATA CAT CTA 85 5 
Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly lie Mis Leu 
275 280 285 

GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA GAA ATA ATA 894 
Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Clu He lie 
290 295 

4 5 GGA GAC ATA AGA AAA GCA TAT TGC AAG ATT AGT AAA GAA 933 

Gly Asp He Arg Lys Ala Tyr Cys Lys lie Ser Lys Glu 
300 305 310 



AAA TGG AAT AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 9 72 
Lys Trp Asn Asn Thr Leu Arg Gin Val Val Lys Lys Leu 
315 320 



AGA GAA CAA TTT GGG AAT AAA ACA ATA ATT TTT AAT CGA 1011 

Arg Glu Gin Phe Gly Asn Lys Thr lie He Phe Asn Arq 

55 325 330 335 

TCC TCA GGA GGG GAC CCA GAA ATT CTA ATG CAC ACT TTT 1050 

Ser Ser Gly Gly Asp Pro Clu He Val Met His Ser Phe 

340 345 350 



AAC TGT GGA GGG GAG TTT TTC TAC TGT AAT ACA ACA CAA 1089 
Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gin 
355 360 
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CTG TTT AAT AGT ACT TGG A AT AAT ACT GAA GGG ACA AAT 1128 
Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn 
365 370 375 

5 AGC ACT GAA GGA AAT AGC ACA ATC ACA CTC CCA TGC AGA 1167 

Ser Thr Glu Gly Asn Ser Thr He Thr Leu Pro Cys Arg 
380 385 

ATA AAA CAA ATT ATA AAT ATG TGG CAG GAA GTA GGA AAA 1206 
10 He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys 
390 395 400 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT AGA TGC 1245 
Ala Thr Tyr Ala Pro Pro He Arg Gly Arg lie Arg Cys 
15 405 410 415 

ATA TCA AAT ATT ACA GGA CTG CTA TTA ACA AGA GAT GGT 1284 
He Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
420 425 



20 



40 



60 



GGT AGG AAT GTC ACA AAC AAT ACC GAN NCC TTC AGA CCT 132 3 
Gly Arg Asn Val Thr Asn Asn Thr Xaa Xaa Phe Arg Pro 
430 435 440 



2 5 GGA GGA GGA GAC ATG AGG GAC AAT TGG AGA AGT GAA TTA 1362 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
445 450 

TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
30 Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly lie 

455 460 465 

GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 
35 470 475 480 

AAA AGA GCA GCA CTA GGA GCT TTG TTC CTT GGG TTC TTA 1479 
Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 
485 490 



GGA GCA TAA AAG CTT CTA GA 1499 
Gly Ala Xaa Lys Leu Leu 
495 499 



45 (2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: Amino Acid 
<D) TOPOLOGY: Linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 
15 10 15 

55 Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 

20 25 30 



Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie Glu 

35 40 45 

Leu Val Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn Lys Met 

50 55 60 



Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp Glu Ser 
65 65 70 75 
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Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
80 85 90 

Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr Asn Thr 
5 95 100 105 

Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr Ala Thr Thr Ser 
110 115 120 

10 Ser Glu Glu Ly9 Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 

125 130 135 



15 



30 



45 



60 



Asn He Thr Thr His Met Lys Asp Lys Val Gin Lys Glu Tyr Ala 
140 145 ISO 

Leu Phe Tyr Lys Leu Asp He Val Pro He Asp Asp Asn Asn Thr 
155 160 165 



Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin Ala 

20 170 175 180 

Cys Pro Met Val Thr Phe Glu Pro He Pro He His Tyr Cys Ala 

185 190 195 

25 Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys Phe Asn 

200 205 210 



Gly Thr Gly Pro Cys Ser Lys Val Ser Thr Val Gin Cys Thr His 

215 220 225 

Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser 

230 235 240 



Leu Ala Glu Glu Glu Val Val He Arg Ser Val Asn Phe Thr Asp 
35 245 250 255 

Asn Ala Lys He He He Val Gin Leu Lys Glu Pro Val Ala He 
260 265 270 

40 Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly He His Leu 

275 280 285 



Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Glu He He Gly Asp 
290 295 300 

He Arg Lys Ala Tyr Cys Lys He Ser Lys Glu Lys Trp Asn Asn 
305 310 315 



Thr Leu Arg Gin Val Val Lys Lys Leu Arg Glu Gin Phe Gly Asn 
50 320 325 330 

Lys Thr He He Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu He 
335 340 345 

55 Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 

350 355 360 



Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr 

365 370 375 

Asn Ser Thr Glu Gly Asn Ser Thr He Thr Leu Pro Cys Arg He 

380 385 390 



Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Thr Tyr 
65 395 400 405 
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Ala Pro Pro lie Arg Gly Arg He Arg Cys He Ser Asn He Thr 

410 415 420 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Arg Asn Val Thr Asn Asn 

5 425 430 435 

Thr Xaa Xaa Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

440 445 450 

10 Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu 

455 460 465 



15 



45 



Gly He Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 

470 475 480 

Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala 

485 490 495 



Xaa Lys Leu Leu 
20 499 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 
25 ( B ) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
{ D ) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

3 0 G GTA CCT GTG TGG AAA GAA GCA A AC AC A ACT CTA TTT 37 

Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe 
1 5 10 

TGT GCA TCA GAT GCT AAA GCA TAT GAT AG A GAA GTA CAT 76 
35 Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 

15 20 25 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 
40 30 35 

AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
Asn Pro Gin Glu lie Val Leu Gly Asn Val Thr Glu Asn 
40 45 50 



TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
55 60 



50 GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA AAG CCA 232 

Glu Asp lie lie Asn Leu Trp Asp Gin Ser Leu Lys Pro 

65 70 75 

TGT GTA AAG TTA ACT CCA CTC TGT GTT ACT TTA AAG TGC 271 

55 Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys 

80 85 90 

AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGC ACT ATT 310 

Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr lie 

60 95 100 

ACC AAT AAT AGT AGT TTG GAG GGA CTA AGA GAA CAA ATG 349 

Thr Asn Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met 

105 110 115 

65 
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ACA AAC TGC TCT TTC AAC ATC ACC ACA AGT ATA AG A GAT 388 
Thr A n Cys Ser Phe Asn lie Thr Thr Ser He Arg Asp 
120 125 

5 AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 427 

Lys Val Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
130 135 140 

GTA GTA CCA ATA GAA GAA GAT GAC AAT ACT AGC TAT AGA 466 
10 Val Val Pro He Glu Glu Asp Asp Asn Thr Ser Tyr Arq 
145 150 155 

TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 
Leu He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys 
15 160 165 

CCA AAG ACA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 544 

Pro Lys Thr Ser Phe Glu Pro He Pro lie His Tyr Cys 
170 175 180 

20 

GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG 583 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys 
185 190 

25 AAG TTC AAT GGA ACA GGA CCA TGT AAA AAT CTC ACC ACA 622 

Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr 
195 200 205 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 661 
30 Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr 

210 215 220 

CAA CTG TTG TTA AAT GGC AGT CTA GCA GAA GAA GAC GTA 700 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 
35 225 230 

GTA ATC AGA TCT GCC AAT TTC ACA GAC AAT GCT AAA ACC 7 39 

Val He Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr 
235 240 245 

40 

ATA ATA GTA CAT CTA AAT GAA ACT GTA AAA ATT AAT TGT 7 78 

He He Val His Leu Asn Glu Thr Val Lyc He Asn Cys 

250 255 

4 5 ACA AGA CTT GGC AAC AAT ACA AGA AAA AGT ATA AAT ATA 817 

Thr Arg Leu Gly Asn Asn Thr Arg Lys Ser He Asn He 
260 265 270 

GGA CCA GGG AGA GTA CTC TAT GCA ACA GGA GAA ATA ATA 856 
50 Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu He He 
275 280 285 

GGA GAC ATA AGA CAA GCA CAT TGT AAC ATT AGT AGA GCA 895 
Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Ala 
55 290 295 

CAA TGG AAT AAG ACT TTA GAA AAG CTA GTT GAC AAA TTA 934 
Gin Trp Asn Lys Thr Leu Glu Lys Val Val Asp Lys Leu 
300 305 310 



60 



AGA AAA CAA TTT GGG GAT AAT ACA ACA ATA GCT TTT AAT 973 
Arg Lys Cln Phe Gly Asp Asn Thr Thr He Ala Phe Asn 
315 320 
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CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC ACT 1012 

Arg Ser Ser Gly Gly Asp Pro Glu He Val Met His Thr 

325 330 335 

5 TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA 1051 

Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr 
340 345 350 

CAA CTG TTT AAT AGT ACT TGG AAT AAT ACT TGG AAG GAT 1090 

10 Gin Leu Phe Asn Ser Thr Trp Asn Asn Thr Trp Lys Asp 

355 360 

CCT AAC AGG AGT GAC AAT ATC ACA CTC CCA TGC AG A ATA 1129 

Pro Asn Arg Ser Asp Asn He Thr Leu Pro Cys Arg He 
15 365 370 375 

AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA 1168 

Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala 
380 385 



20 



ATG TAC GCC CCT CCC ATC AGA GGG GAA ATT AGA TGT TCA 1207 
Met Tyr Ala Pro Pro He Arg Gly Glu He Arg Cys Ser 
390 395 400 



25 TCA AAT ATC ACA GGG CTG CTA CTA ACA ACA GAT GGT GGT 124 6 

Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
405 410 415 

AAT GAC GAT GGT AAT GAC ACG ACC ACA AAC AGG ACC GAG 1285 
30 Asn Asp Asp Gly Asn Asp Thr Thr Thr Asn Arg Thr Glu 

420 425 

ATC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1324 
He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
35 430 435 440 

AGA AGT GAA TTA TAT AGA TAT AAA GTA GTA AAA ATT GAA 1363 

Arg Ser Glu Leu Tyr Arg Tyr Lys Val Val Lys He Glu 
445 450 

40 

CCA TTA GGA ATA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1402 

Pro Leu Gly He Ala Pro Thr Arg Ala Lys Arg Arg Val 
455 460 465 

4 5 GTG CAG AGA GAA AAA AGA GCA GTA GGA CTA GGA GCT TTG 1441 

Val Gin Arg Glu Lys Arg Ala Val Gly Leu Gly Ala Leu 
470 475 480 

TTC CTT GGG T TCTTAGGAG CATAAAGCTT CTAGA 14 75 
50 Phe Leu Gly 
483 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 491 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

60 Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser 
15 10 15 

Asp Ala Lys Ala Tyr Asp Arg Glu Val His Asn Val Trp Ala Thr 
20 25 30 

65 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



His Ala Cys VaX Pro Thr Asp Pro Asn Pro Gin Glu lie Val Leu 
35 40 45 

Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 
50 55 60 

Glu Gin Met His Glu Asp He He Asn Leu Trp Asp Gin Ser Leu 
65 70 75 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lvs Cvs 
80 85 |o 

Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr He Thr Asn 
95 100 105 

Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met Thr Asn Cys Ser 
HO H5 120 

Phe Asn lie Thr Thr Ser lie Arg Asp Lys Val Gin Lys Glu Tvr 
125 130 X J 3S 

Ala Leu Leu Tyr Lys Leu Asp Val Val Pro He Glu Glu Asp Aso 
140 145 150 

Asn Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr 
155 160 165 

Gin Ala Cys Pro Lys Thr Ser Phe Glu Pro He Pro He His Tvr 
170 i 7 5 ^ 

Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys 
185 190 !9 5 

Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cvs 
200 205 210 

Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
215 220 225 

Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Ala Asn Phe 
230 235 240 

Thr Asp Asn Ala Lys Thr He He Val His Leu Asn Glu Thr Val 
245 250 255 

Lys He Asn Cys Thr Arg Leu Gly Asn Asn Thr Arg Lys Ser He 
260 265 270 

Asn He Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu He lie 
275 280 285 

Gly Asp He Arg Gin Ala His Cys Asn lie Ser Arg Ala Gin Tro 
290 295 300 

Asn Lys Thr Leu Glu Lys Val Val Asp Lys Leu Arg Lys Gin Phe 
305 310 3i 5 

Gly Asp Asn Thr Thr He Ala Phe Asn Arg Ser Ser Gly Gly Asd 
320 325 33 5 

Pro Glu He Val Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe 
335 340 345 

Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Thr 
350 355 360 
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Trp Lys Asp Pro Asn Arg Ser Asp Asn lie Thr Leu Pro Cys Arg 

365 370 375 

ILe Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 

5 380 385 390 

Tyr Ala Pro Pro lie Arg Gly Glu lie Arg Cys Ser Ser Asn lie 

395 400 405 

10 Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asp Asp Gly Asn 

410 415 420 



15 



35 



55 



Asp Thr Thr Thr Asn Arg Thr Glu lie Phe Arg Pro Gly Gly Gly 
425 430 435 

Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Arg Tyr Lys Val 
440 445 450 



Val Lys lie Glu Pro Leu Gly lie Ala Pro Thr Arg Ala Lys Arg 
20 455 460 465 

Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu Gly Ala Leu 
470 475 480 

25 Phe Leu Gly Phe Leu Gly Ala Leu Phe Leu Gly 

485 490 491 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1475 base pairs 

<B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



G GTA CCT GTG TGG AAA GAA GCA AAC ACA ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe 
15 10 



4 0 TGT GCA TCA GAT GCT AAA GCA TAT GAT AGA GAA GTA CAT 76 

Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 
15 20 25 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
45 Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

30 35 

AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
Asn Pro Gin Glu lie Val Leu Gly Asn Val Thr Glu Asn 
50 40 45 50 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
55 60 



GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA A AG CCA 232 
Glu Asp lie lie Asn Leu Trp Asp Gin Ser Leu Lys Pro 
65 70 75 



60 TGT GTA A AG TTA ACT CCA CTC TGT GTT ACT TTA AAG TGC 271 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys 
80 85 90 
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AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGC ACT ATT 310 
Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr He 
95 100 

5 ACC AAT AAT AGT AGT TTG GAG GGA CTA AGA GAA CAA ATG 349 

Thr Asn Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met 
105 HO us 

ACA AAC TGC TCT TTC AAC ATC ACC ACA AGT ATA AGA GAT 388 
10 Thr Asn Cys Ser Phe Asn He Thr Thr Ser lie Arg Asp 

120 125 

AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 42 7 
Lys Val Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
15 130 135 140 



20 



GTA GTA CCA ATA GAA GAA GAT GAC AAT ACT ACC TAT AGA 4 66 
Val Val Pro He Glu Glu Asp Asp Asn Thr Ser Tyr Arq 
145 150 isl 

TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 
Leu He Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys 
160 165 

2 5 CCA AAG ACA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 54 4 

Pro Lys Thr Ser Phe Glu Pro He Pro He His Tyr Cys 
170 175 180 



30 



40 



60 



GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG 583 
Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys 
185 190 



AAG TTC AAT GGA ACA GGA CCA TGT AAA AAT GTC AGC ACA 622 
Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr 
35 195 200 20S 



GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 661 
Val Gin Cys Thr His Gly He Arg Pro Val Val Sor Thr 
210 215 220 

CAA CTG TTG TTA AAT GCC AGT CTA CCA GAA GAA GAG CTA 700 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 
225 230 

45 GTA ATC AGA TCT GCC AAT TTC ACA GAC AAT GCT AAA ACC 739 

Val He Arg Ser A La Asn Phe Thr Asp Asn Ala Lys Thr 
235 240 245 

ATA ATA GTA CAT CTA AAT GAA ACT GTA AAA ATT AAT TGT 778 
50 lie lie Val His Leu Asn Glu Thr Val Lvs He Asn Cys 

250 255 

ACA AGA CTT GGC AAC AAT ACA AGA AAA AGT ATA AAT ATA 817 
Thr Arg Leu Gly Asn Asn Thr Arg Lys Ser lie Asn lie 
55 260 265 270 

GGA CCA GGG AGA GTA CTC TAT GCA ACA GGA GAA ATA ATA 856 
Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu He He 
275 280 285 



GGA GAC ATA AGA CAA GCA CAT TGT AAC ATT AGT AGA GCA 89 5 
Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Ala 
290 295 
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CAA TGG A AT AAG ACT TTA GAA AAG GTA GTT GAC AAG TTA 934 
Gin Trp Asn Lys Thr Leu Glu Lys Val Val Asp Lys Leu 
300 305 310 

5 AG A AAA CAA TTT GGG GAT AAT ACA AC A ATA GCT TTT AAT 973 

Arg Lys Gin Phe Gly Asp Asn Thr Thr lie Ala Phe Asn 
315 320 

CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC ACT 1012 
10 Arg Ser Ser Gly Gly Asp Pro Glu lie Val Met His Thr 
325 330 335 

TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA 1051 
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr 
15 340 345 350 

CAA CTG TTT AAT AGT ACT TGG AAT AAT ACT TGG AAG GAT 1090 
Gin Leu Phe Asn ser Thr Trp Asn Asn Thr Trp Lys Asp 
355 360 



20 



40 



CCT AAC AGG AGT GAC AAT ATC ACA CTC CCA TGC AG A ATA 112 9 
Pro Asn Arg Ser Asp Asn He Thr Leu Pro Cys Arg He 
365 370 375 



25 AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA 1168 

Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala 
380 385 

ATG TAC GCC CCT CCC ATC AG A GGG GAA ATT AG A TGT TCA 120 7 

30 Met Tyr Ala Pro Pro He Arg Gly Glu lie Arg Cys Ser 

390 395 400 

TCA AAT ATC ACA GGG CTG CTA CTA ACA AG A GAT GGT GGT 124 6 

Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
35 405 410 415 

AAT GAC GAT GGT AAT GAC ACG ACC ACA AAC AGG ACC GAG 128 5 

Asn Asp Asp Gly Asn Asp Thr Thr Thr Asn Arg Thr Glu 
420 425 



ATC TTC AG A CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1324 
lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
430 435 440 



4 5 AGA AGT GAA TTA TAT AG A TAT AAA GTA GTA AAA ATT GAA 1363 

Arg Ser Glu Leu Tyr Arg Tyr Lys Val Val Lys He Glu 
445 450 



CCA TTA GGA ATA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1402 
50 Pro Leu Gly He Ala Pro Thr Arg Ala Lys Arg Arg Val 

455 460 465 

GTG CAG AGA GAA AAA AGA GCA GTA GGA CTA GGA GCT TTG 1441 
Val Gin Arg Glu Lys Arg Ala Val Gly Leu Gly Ala Leu 
55 470 475 480 



TTC CTT GGG TTC TTG GGA GCA TAA AGC TTC TAG A 14 75 
Phe Leu Gly Phe Leu Cly Ala Xaa Ser Phe Xaa 
485 490 491 

60 

(2) INFORMATION FOR SEQ ID NO: 24: - 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: Amino Acid 
65 (D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser 

D 

Asp Ala Lys Ala Tyr Asp Arg Glu val His Asn Val Trp Ala Thr 

20 25 30 

His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu He Val Leu 

10 35 40 45 

Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 

50 55 60 

15 Glu Gin Met His Glu Asp He He Asn Leu Trp Asp Gin Ser Leu 

65 70 75 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys 

80 85 90 



20 



35 



50 



60 



65 



Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr He Thr Asn 
95 100 105 



Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met Thr Asn Cys Ser 

25 110 H5 12 o 

Phe Asn He Thr Thr Ser He Arg Asp Lys Val Gin Lys Glu Tyr 

125 130 135 

30 Ala Leu Leu Tyr Lys Leu Asp Val Val Pro He Glu Glu Asp Asp 

1^0 US F iso 



Asn Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val He Thr 
155 160 i 6 5 

Gin Ala Cys Pro Lys Thr Ser Phe Glu Pro He Pro He His Tyr 
170 175 180 

Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys 
40 185 190 195 

Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys 
200 205 210 

45 Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 

215 220 225 



Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Ala Asn Phe 

230 235 240 

Thr Asp Asn Ala Lys Thr He He Val His Leu Asn Glu Thr Val 

245 250 255 

Lys He Asn Cys Thr Arg Leu Gly Asn Asn Thr Arg Lys Ser He 

55 260 265 270 



Asn He Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu He He 
275 280 285 

Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Ala Gin Trp 
290 295 3oo- 

Asn Lys Thr Leu Glu Lys Val Val Asp Lys Leu Arg Lys Gin Phe 
305 310 315 
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10 



15 



Gly Asp Asn Thr Thr lie Ala Phe Asn Arg Ser Ser Gly Gly Asp 
320 325 330 

Pro Glu lie Val Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe 
335 340 345 

Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Thr 
350 355 360 

Trp Lys Asp Pro Asn Arg Ser Asp Asn He Thr Leu Pro Cys Arg 
365 370 375 

He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met 
380 385 390 

Tyr Ala Pro Pro He Arg Gly Glu He Arg Cys Ser Ser Asn He 
395 400 405 

Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asp Asp Gly Asn 
20 410 415 420 

Asp Thr Thr Thr Asn Arg Thr Glu He Phe Arg Pro Gly Gly Gly 
425 430 435 

25 Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Arg Tyr Lys Val 

440 445 450 

Val Lys He Glu Pro Leu Gly He Ala Pro Thr Arg Ala Lys Arg 

455 460 465 

30 

Arq Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu Gly Ala Leu 

470 475 480 

Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 
35 485 490 491 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 
40 (8) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 



4 5 CTC GAG GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT 36 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

I 5 10 

CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT TCA GAG 75 

50 Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu 
15 20 25 

GCA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA 114 

Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 

55 30 35 

GAC CCC AAC CCA CAA GAA GTA GAA TTG GAA AAT GTG ACA 153 

Asp Pro Asn Pro Gin Glu Val Glu Leu Glu Asn Val Thr 

40 45 50 

60 

GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG 192 

Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 
S5 60 
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ATG CAT GGG GAT ATA ATT ACT TTA TCG GAT CAA AGC CTA 231 
Met His Gly Asp He lie Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

5 AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACC TTA 270 

Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

AAT TGC ACT GAC CCA AAT GTT ACT AAT AGC GAG AGA ACG 309 
10 Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Clu Arg Thr 

95 100 

ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 34 8 
He Glu Gly Gly Glu He Lys Asn Cys Ser Phe Asn He 
15 105 110 us 

ACC ACA AAC ATA AGA GAT AGG TTT CAG AAA GAA TAT GCA 387 
Thr Thr Asn He Arg Asp Arg Phe Gin Lys Glu Tyr Ala 
120 125 

20 

CTT TTT TAT AAA CTT GAT GTA ATA CCA TTA GGT AAT GAT 426 
Leu Phe Tyr Lys Leu Asp Val lie Pro Leu Gly Asn Asp 
130 135 140 

2 5 AAT ACT AGC TAT AGG TTG ATA AGT TGT AAC ACC TCA GTC 465 

Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val 
145 150 155 

ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 
30 He Thr Gin Ala Cys Pro Lys Val Ser Phe Clu Pro He 

160 165 

CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT CCG ATT CTA 54 3 
Pro lie His Tyr Cys Ala Pro Ala Gly Phe- Ala lie Leu 
35 170 175 180 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 

Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 
185 190 

40 

ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AAG 621 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Lys 

195 200 205 

4 5 CCA GTA GTA TCA ACT CAA CTG TTG TTA AAT GGC ACT CTA 660 

Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
210 215 220 

GCA GAA GAA GAC ATA GTA ATT AGA TCC GCC AAT CTC ACA 699 
50 Ala Glu Glu Asp He Val He Arg Ser Ala Asn Leu Thr 

225 230 

GAC AAT GCT AAA AAC ATA ATA GTA CAG CTG AAT GAA TCT 738 
Asp Asn Ala Lys Asn He He Val Gin Leu Asn Glu Ser 
55 235 240 245 

GTA ACA ATG AAT TGT ACA AGA CCC AAC AAC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Pro Asn Asn Asn Thr Met 
250 255 



60 



AAA AGT ATA CAT ATA GGA CCA GGC ACA CCA TTT TAT GCA 816 
Lys Ser He His He Gly Pro Cly Arg Ala Phe Tyr Ala 
260 265 270 
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ACA GGA AAC ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT 855 

Thr Gly Asn lie lie Gly Asp lie Arg Gin Ala His Cys 
275 280 285 

5 AAC ATT AGT GGA ACA AAA TGG AAT GAC ACT TTG AAA AAG 894 

Asn He Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys 

290 295 

ATA GCT ATA AAA TTA AGA GAA CAA TTT AAT AAG ACA ATA 93J 

10 He Ala He Lys Leu Arg Glu Gin Phe Asn Lys Thr He 
300 305 310 

GTC TTT AAT CAA TCC TCA GGA GGG GAC CCA GAA ATT GCA 972 

Val Phe Asn Gin Ser Ser Gly Gly Asp Pro Glu He Ala 
15 315 320 

ACG CTC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT 1011 

Thr Leu Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
325 330 335 



20 



40 



60 



AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT 1050 
Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr 
340 345 350 



2 5 GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 

Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr lie Thr Leu 
355 360 

CCA TGC AGA ATA AGA CAA ATT ATA AAC ATG TGG CAG AAA 1128 

30 Pro Cys Arg lie Arg Gin lie lie Asn Met Trp Gin Lys 

365 370 375 

ATA GGA AAA GCA ATG TAT GCC CCT CCC ATC AAA GGG CAA 1167 

He Gly Lys Ala Met Tyr Ala Pro Pro lie Lys Gly Gin 
35 380 385 

ATT AGA TGT TCA TCA AAT ATT ACA GGG CTA ATA TTA ACA 1206 

He Arg Cys Ser Ser Asn He Thr Gly Leu lie Leu Thr 
390 395 400 



AGA GAT GGT GGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 1245 
Arq Asp Gly Gly Asn Asn Asn Met Ser Lys Thr Thr Glu 
y 405 410 415 



45 ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1284 

Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

420 425 

AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA 132 3 

50 Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
430 435 440 

CCA TTA GGA GTA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1362 

Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val 

55 445 450 

GTG CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT GTG 1401 

Val Gin Arg Glu Lvs Arg Ala Val Gly lie Gly Ala Val 

455 ' 460 465 



TTC CTT GGG TTC TTG GGA GCA TAA AGC TTC TAG A 14 35 
Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 
470 475 478 
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10 



20 



25 



40 



50 



55 



(2) INFORMATION FOR SBQ ID NO: 26; 
(L) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 478 amino acids 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
15 10 is 

Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu Ala His Asn Val Trp 
20 25 30 



Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val 
15 35 40 45 



Glu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 
50 55 60 

Met Val Glu Gin Met His Gly Asp lie lie Ser Leu Trp Asp Gin 
65 70 75 

Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr He Glu 
95 100 105 

Gly Gly Glu lie Lys Asn Cys Ser Phe Asn He Thr Thr Asn He 
30 110 H5 120 

Arg Asp Arg Phe Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp 
125 130 135 

35 Val He Pro Leu Gly Asn Asp Asn Thr Ser Tyr Arg Leu He Ser 

140 145 150 



Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe 

155 160 165 

Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He 

170 175 180 



Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr 
45 185 190 195 



Asn Val Ser Thr Val Gin Cys Thr His Gly He Lys Pro Val Val 

200 205 210 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp He 

215 220 225 

Val He Arg Ser Ala Asn Leu Thr Asp Asn Ala Lys Asn He lie 

230 235 240 

Val Gin Leu Asn Glu Ser Val Thr Met Asn Cys Thr Arg Pro Asn 

245 250 255 



Asn Asn Thr Met Lys Ser He His He Gly Pro Gly Arg Ala Phe 

60 260 265 270 

Tyr Ala Thr Gly Asn He He Gly Asp He Arg Gin Ala His Cys 

275 280 285 
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Asn He Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys lie Ala 

290 295 300 

lie Lvs Leu Arg Glu Gin Phe Asn Lys Thr He Val Phe Asn Gin 

1 305 310 315 

Ser Ser Gly Gly Asp Pro Glu He Ala Thr Leu Ser Phe Asn Cys 

320 325 330 

Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser 

335 340 345 

Thr Trp Asn Ser Thr Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr 

F 350 355 360 

He Thr Leu Pro Cys Arg He Arg Gin lie He Asn Met Trp Gin 

365 370 375 

Lvs He Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin He 

y 380 385 390 

Arq Cys Ser Ser Asn He Thr Gly Leu He Leu Thr Arg Asp Gly 

395 400 405 

25 Gly Asn Asn Asn Met Ser Lys Thr Thr Glu Thr Phe Arg Pro Gly 

410 415 420 

Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

425 430 435 

Lvs Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg Ala 

* 440 445 450 

Lvs Arq Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 

455 460 465 

Ala Val Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 

470 475 478 

40 (2) INFORMATION FOR SEQ ID NO:27: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CTC GAG GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT 36 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

50 1 5 10 

CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT TCA GAG 7 5 

Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu 

15 20 25 



30 



35 



55 



60 



GCA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA 114 

Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr 
30 35 

GAC CCC AAC CCA CAA GAA GTA GAA TTG GAA AAT GTG ACA 153 

Asp Pro Asn Pro Gin Glu Val Glu Leu Glu Asn Val Thr 

40 45 50 
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GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG 192 
Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 
55 60 

5 ATG CAT GGG GAT ATA ATT AGT TTA TCG GAT CAA AGC CTA 231 

Met His Gly Asp lie lie Ser Leu Trp Asp Gin Ser Leu 
65 70 75 

AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACG TTA 2 70 
10 Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 

80 85 go 

AAT TGC ACT GAC CCA AAT GTT ACT AAT AGC GAG AGA ACG 309 
Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr 
15 95 100 



20 



ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 34 8 
He Glu Gly Gly Glu He Lys Asn Cys Ser Phe Asn He 
105 HO us 

ACC ACA AAC ATA AGA GAT AGC TTT CAG AAA GAA TAT GCA 387 
Thr Thr Asn He Arg Asp Arg Phe Gin Lye Glu Tyr Ala 
120 125 

2 5 CTT TTT TAT AAA CTT GAT GTA ATA CCA TTA GOT AAT GAT 426 

Leu Phe Tyr Lys Leu Asp Val He Pro Leu Gly Asn Asp 
130 135 140 

AAT ACT AGC TAT AGG TTG ATA AGT TGT AAC ACC TCA GTC 465 
30 Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val 
145 150 155 

ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAC CCA ATT 504 
He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He 
35 160 165 

CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA 543 

Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu 
170 175 180 

40 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 

Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 
185 190 

4 5 ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AAG 621 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Lys 
195 200 205 

CCA GTA GTA TCA ACT CAA CTG TTG TTA AAT GGC AGT CTA 660 
50 Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu 
210 215 220 

GCA GAA GAA GAC ATA GTA ATT AGA TCC GCC AAT CTC ACA 699 
Ala Glu Glu Asp He Val He Arg Ser Ala Asn Leu Thr 
55 225 230 

GAC AAT GCT AAA AAC ATA ATA GTA CAG CTG AAT GAA TCT 738 
Asp Asn Ala Lys Asn He He Val Gin Leu Asn Glu Ser 
235 240 245 



60 



GTA ACA ATG AAT TGT ACA AGA CCC AAC AAC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Pro Asn Asn Asn Thr Met 
250 255 



-146- 



WO 98/01564 



PCT/US97/09690 



AAA AGT ATA CAT ATA GGA CCA GGC AG A GCA TTT TAT GCA 816 

Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Ala 
260 26S 270 

5 ACA GGA AAC ATA ATA GGA GAT ATA AG A CAA GCA CAT TGT 855 

Thr Gly Asn He He Gly Asp He Arg Gin Ala His Cys 
275 280 285 

AAC ATT AGT GGA ACA AAA TGG AAT GAC ACT TTG AAA AAG 894 

10 Asn He Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys 

290 295 

ATA GCT ATA AAA TTA AG A GAA CAA TTT AAT AAG ACA ATA 933 

He Ala He Lys Leu Arg Glu Gin Phe Asn Lys Thr lie 
15 300 305 310 

GTC TTT AAT CAA TCC TCA GGA GGG GAC CCA GAA ATT GCA 9 72 

Val Phe Asn Gin Ser Ser Gly Gly Asp Pro Glu He Ala 
315 320 

20 

ACG CTC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT 1011 

Thr Leu Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
325 330 335 

2 5 AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT 1050 

Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr 
340 345 350 

GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 

30 Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr He Thr Leu 

355 360 

CCA TGC AG A ATA AG A CAA ATT ATA AAC ATG TGG CAG AAA 1128 

Pro Cys Arg He Arg Gin He lie Asn Met Trp Gin Lys 
35 365 370 375 

ATA GGA AAA GCA ATG TAT GCC CCT CCC ATC AAA GGG CAA 1167 

He Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin 
380 385 



40 



60 



ATT AG A TGT TCA TCA AAT ATT ACA GGG CTA ATA TTA ACA 1206 
He Arg Cys Ser Ser Asn He Thr Gly Leu He Leu Thr 
390 395 400 



4 5 AG A GAT GGT GGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 1245 

Arq Asp Gly Gly Asn Asn Asn Met Ser Lys Thr Thr Glu 
405 410 415 

ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1284 
50 Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

420 425 

AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA 132 3 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu 
55 430 435 440 

CCA TTA GGA GTA GCA CCC ACC AGG GCA AAG AGA AGA GTG 1362 
Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val 
445 450 



GTG CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT GTG 1401 
Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Val 
455 460 465 



-147- 



WO 98/01564 



PCI7US97/W690 



20 



TTC CTT GGG TTC TTG GGA GCA TAA AGC TTC TAG A 1435 
Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 
470 475 478 

5 (2) INFORMATION FOR SEQ ID NO: 28: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
15 10 is 

15 Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu Ala His Asn Val Trp 

20 25 30 

Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val 
35 40 45 

Glu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 
50 55 60 

Met Val Clu Gin Met His Gly Asp lie rie Ser Leu Trp Asp Gin 
25 65 70 75 

Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

30 Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr lie Glu 

95 100 105 

Gly Gly Glu He Lys Asn Cys Ser Phe Asn He Thr Thr Asn He 
110 115 120 

35 

Arg Asp Arg Phe Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp 
125 130 135 

Val He Pro Leu Gly Asn Asp Asn Thr Ser Tyr Arg Leu He Ser 
0 140 145 150 

Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe 
155 160 165 

5 Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He 

170 175 180 



0 



Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr 
185 190 195 

Asn Val Ser Thr Val Gin Cys Thr His Gly He Lys Pro Val Val 
200 205 210 



Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp lie 

5 215 220 225 

Val He Arg Ser Ala Asn Leu Thr Asp Asn Ala Lys Asn He He 

230 235 240 

0 Val Gin Leu Asn Glu Ser Val Thr Met Asn Cys Thr Arg Pro Asn 

245 250 255 



5 



Asn Asn Thr Met Lys Ser He His He Gly Pro Gly Arg Ala Phe 
260 265 270 
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Tyr Ala Thr Gly Asn lie lie Gly Asp lie Arg Gin Ala His Cys 

275 280 285 

Asn lie Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys Lys lie Ala 

5 290 295 300 

lie Lys Leu Arg Glu Gin Phe Asn Lys Thr lie Val Phe Asn Gin 

305 310 315 

10 Ser Ser Gly Gly Asp Pro Glu He Ala Thr Leu Ser Phe Asn Cys 

320 325 330 



15 



30 



55 



Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser 
335 340 345 

Thr Trp Asn Ser Thr Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr 
350 355 360 



He Thr Leu Pro Cys Arg He Arg Gin He He Asn Met Trp Gin 

20 365 370 375 

Lvs He Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin He 

380 385 390 

25 Arg Cys Ser Ser Asn He Thr Gly Leu lie Leu Thr Arg Asp Gly 

395 400 405 



Gly Asn Asn Asn Met Ser Lys Thr Thr Glu Thr Phe Arg Pro Gly 

410 415 420 

Glv Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

425 430 435 



Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg Ala 
35 440 445 450 

Lvs Arq Arq Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
455 460 465 

40 Ala Val Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 

470 475 478 

(2) INFORMATION FOR SEQ ID NO: 29: 
<i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 511 amino acids 

{ B ) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

50 Met Arq Val Lys Gly He Arg Arg Asn Tyr Gin His Trp Trp Gly Arg 
15 10 15 



Gly Thr Met Leu Leu Gly Leu Leu Met lie Cys Ser Ala Thr Glu Lys 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 



Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Ala 

60 50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 
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Gin Giu Val Glu Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 r 

Asn Asn Met Val Glu Gin Met His Glu Asp He lie Ser Leu Tro Asn 
l °0 105 110 p 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn Thr Asn Asn Ser Thr Asp 
130 135 

Asn Asn Asn Ser Lys Ser Glu Gly Thr He Lys Gly Gly Glu Met Lys 
145 150 155 i6o 

Asn Cys Ser Phe Asn lie Thr Thr Ser lie Cly Asp Lys Met Gin Lys 
165 170 i7 5 

Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie Glu Pro lie Asp Asn Asp 
180 185 190 

Ser Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val He Thr Gin 
195 200 205 

Ala Cys Pro Lys He Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala 
210 215 220 

Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys Phe Ser Gly 
225 230 235 240 

Lys Gly Ser Cys Lys Asn Val Ser Thr Val Gin Cyc Thr His Gly He 
245 250 255 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Cly Ser Leu Ala Glu 
260 265 270 

Glu Glu Val Val He Arg Ser Glu Asp Phe Thr Asp Asn Ala Lys Thr 
275 2S0 285 

He He Val His Leu Lys Glu Ser Val Gin lie Asn Cys Thr Arq Pro 
290 295 300 

Asn Tyr Asn Lys Arg Lys Arg lie His He Gly Pro Gly Arg Ala Phe 
305 3i0 315 320 

Tyr Thr Thr Lys Asn He Lys Gly Thr He Arg Gin Ala His Cys He 
325 330 335 

He Ser Arg Ala Lys Trp Asn Asp Thr Leu Arg Gin He Val Ser Lvs 
340 345 350 * 

Leu Lys Glu Gin Phe Lys Asn Lys Thr He Val Phe Asn Pro Ser Ser 
355 360 365 

Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Cly Gly Glu 
370 375 380 

Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser He Trp Asn Gly 
385 390 395 400 

Asn Asn Thr Trp Asn Asn Thr Thr Cly Ser Asn Asn Asn He Thr Leu 
405 410 4i 5 

Gin Cys Lys He Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys 
420 42S 430 * Y 
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Ala Met Tyr Ala Pro Pro II Glu Gly Gin lie Arg Cys Ser Ser Asn 
435 440 445 

He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Asp Thr Asp Thr 

5 450 455 460 

Asn Asp Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 

465 470 475 480 

10 Trp Arq Ser Glu Leu Tyr Lys Tyr Lys Val Val Thr He Glu Pro Leu 

r * 485 490 495 



15 



35 



40 



45 



50 



55 



Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu 
500 505 510 



(2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2800 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

25 



30 



TTCGAGCTCG 


CCCGACATTG 


ATTATTGACT 


AGAGTCGATC 


GACAGCTGTG 


50 


GAATGTGTGT 


CAGTTAGGGT 


GTGGAAAGTC 


CCCAGGCTCC 


CCAGCAGGCA 


100 


GAAGTATGCA 


AAGCATGCAT 


CTCAATTAGT 


CAGCAACCAG 


GTGTGGAAAG 


150 


TCCCCAGGCT 


CCCCAGCAGG 


CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


200 


GTCAGCAACC 


ATAGTCCCGC 


CCCTAACTCC 


GCCCATCCCG 


CCCCTAACTC 


250 


CGCCCAGTTC 


CGCCCATTCT 


CCGCCCCATG 


GCTGACTAAT 


TTTTTTTATT 


300 


TATGCAGAGG 


CCGAGGCCGC 


CTCGGCCTCT 


GAGCTATTCC 


AGAAGTAGTG 


350 


AGGAGGCTTT 


TTTGGAGGCC 


TAGGCTTTTG 


CAAAAAGCTA 


GCTTATCCGG 


400 


CCGGGAACGG 


TGCATTGGAA 


CGCGGATTCC 


CCGTGCCAAG 


AGTCAGGTAA 


450 


GTACCGCCTA 


TAGAGTCTAT 


AGGCCCACCC 


CCTTGGCTTC 


GTTAGAACGC 


500 


GGCTACAATT 


AATACATAAC 


CTTTTGGATC 


GATCCTACTG 


ACACTGACAT 


550 


CCACTTTTTC 


TTTTTCTCCA 


CAGGTGTCCA 


CTCCCAGGTC 


CAACTGCACC 


600 


TCGGTTCGCG 


AAGCTAGCTT 


GGGCTGCATC 


GATTGAATTC 


CACTGCCTTC 


650 


CACCAAGCTC 


TGCAGGATCC 


CAGAGTCAGG 


GG TCT GTA 


TCT TCC TGC 


697 



Ser Val Ser Ser Cys 

1 5 

TGG TGG CTC CAG TTC AGG A AC AGT AAA CCC TGC TCC GAA TAT 7 39 
Trp Trp Leu Gin Phe Arg Asn Ser Lys Pro Cys Ser Glu Tyr 
10 15 

60 TGC CTC TCA CAT CTC GTC A AT CTC CGC GAG GAC TGG GGA CCC 781 
Cys Leu Ser His Leu Val Asn Leu Arg Glu Asp Trp Gly Pro 
20 25 30 
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TCT GAC AAG CTT CAG CGC GAA CGA CCA ACT ACC CCG ATC ATC 823 
Cys Asp Lys Leu Gin Arg Glu Arg Pro Thr Thr Pro He He 
35 40 45 

AGT TAT CCT TAA GGT CTC TTT TGT GTG GTG CGT TCC GGT ATG 865 
Ser Tyr Pro * Gly Leu Phe Cys Val Val Arg Ser Gly Met 
50 55 6 o 

GGG GGG ACT GCC GCC AGG TTG GGG GCC GTG ATT TTG TTT GTC 907 
Gly Gly Thr Ala Ala Arg Leu Gly Ala Val He Leu Phe Val 
62 65 70 75 

GTC ATA GTG GGC CTC CAT GGG GTC CGC GGC AAA TAT GCC TTG 949 
Val He Val Gly Leu His Gly Val Arg Gly Lys Tyr Ala Leu 

80 85 

GCG GAT GCC TCT CTC AAG ATG GCC GAC CCC AAT CGA TTT CGC 991 
Ala Asp Ala Ser Leu Lys Met Ala Asp Pro Asn Arq Phe Ara 
20 90 95 ioo 

GGC AAA GAC CTT CCG GTC CTG GAC CAG CTG CTC GAG GTA CCT 1033 
Gly Lys Asp Leu Pro Val Leu Asp Gin Leu Leu Glu Val Pro 
105 no 115 

GTG TGG AAA GAA GCA AAC ACC ACT CTA TTT TGT GCA TCA GAT 1075 
Val Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp 
120 125 130 

3 0 GCT AAA GCA TAT AAG ACA GAG GCA CAT AAT GTT TGG GCC ACA 1117 
Ala Lys Ala Tyr Lys Thr Glu Ala His Asn Val Trp Ala Thr 
135 140 1^5 

CAT GCC TGT GTA CCC ACA GAC CCC AAA CCA CAA GAA ATA AAA 1159 
35 His Ala Cys Val Pro Thr Asp Pro Lys Pro Gin Glu He Lys 

150 155 

TTG GAA AAT GTG ACA GAA AAT TTT AAC ATG TGG AAA AAT AAC 1201 
Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 
40 160 165 170 

ATG GTA GAA CAG ATG CAT GAG GAT ATA ATC ACT TTA TGG GAT 124 3 

Met Val Glu Gin Met His Glu Asp He lie Ser Leu Trp Asp 
175 180 185 

45 

CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT 1285 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val 

1^0 195 200 

50 ACT TTA AAT TGC ACT GAT TTG AGG AAT AAT ACT AAT ACC AAT 1327 
Thr Leu Asn Cys Thr Asp Leu Arg Asn Asn Thr Asn Thr Asn 
205 210 215 

AGT ACC TAC GGA AAA ATA ATG GAG GGA GGA GAG ATA AAA AAC 1369 
55 Ser Thr Tyr Gly Lys He Met Glu Gly Gly Glu He Lys Asn 

220 225 



60 



TGC TCT TTC AAT ATC ACC ACA AGC ATA AAA GAT AAG CTG AAA 1411 
Cys Ser Phe Asn He Thr Thr Ser He Lys Asp Lys Leu Lvs 
230 235 240 



GAT ATG TCA CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GGT 1453 
Asp Met Ser Leu Phe Tyr Lys Leu Asp Val Val Pro lie Glv 
65 245 250 255 
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AAT AAT 
Asn Asn 

5 

ACC TCA 
Thr Ser 



10 CCA ATT 
Pro lie 



CTC AAG 
15 Leu Lys 
300 

CCA AAT 
Pro Asn 
20 315 

GTA GTA 
Val Val 

25 

AAA GAG 
Lys Glu 



30 AAA ACC 
Lys Thr 



TGT ATG 

3 5 Cys Met 
370 

GGA CCA 

Gly Pro 

40 385 

GAT ATA 

Asp lie 

45 

AAT ACC 

Asn Thr 



50 TTT GAG 
Phe Glu 



GAC CCA 
55 Asp Pro 
440 

TTT TTC 
Phe Phe 
60 455 

AAT GAT 
Asn Asp 

65 



AGT AAT ACT 
Ser Asn Thr 
260 

GTC ATT ACA 
Val lie Thr 
275 

CCC ATA CAT 
Pro He His 
290 

TGT AAT GAT 
Cys Asn Asp 



GTC AGC ACA 
Val Ser Thr 



TCA ACT CAA 
Ser Thr Gin 
330 

GTA GTC CTT 
Val Val Leu 
345 

ATA ATA GTA 
He lie Val 
360 

AGA CCC AAC 
Arg Pro Asn 



GGG AAA GCA 
Gly Lys Ala 



AGA CGA GCA 
Arg Arg Ala 
400 

ACT TTA CAA 
Thr Leu Gin 
415 

AAC AAA ACA 
Asn Lys Thr 
430 

GAA ATT GTA 
Glu He Val 



TGC TGT AAT 
Cys Cys Asn 



GCA CAA CTG 
Ala Gin Leu 
470 



ACT AGT TAT 
Thr Ser Tyr 
265 

CAA GCC TGT 
Gin Ala Cys 



TAT TGT GCC 
Tyr Cys Ala 



AAT AAG TTC 
Asn Lys Phe 
305 

GTA CAA TGT 
Val Gin Cys 
320 

CTG CTG TTA 
Leu Leu Leu 
335 

AGA TCT GAA 
Arg Ser Glu 



CAG CTG AAC 
Gin Leu Asn 



AAC AAT ACA 
Asn Asn Thr 
375 

TTT TAT GCA 
Phe Tyr Ala 
390 

CAT TGT AAC 
His Cys Asn 
405 

CAG ATA GCT 
Gin He Ala 



ATA GTT TTT 
He Val Phe 



ATG CAC ACT 
Met His Thr 
445 

TCA ACA CCA 
Ser Thr Pro 
460 

TTT AAT AGT 
Phe Asn Ser 
475 



AGG TTG ATA 
Arg Leu He 



CCA AAG ACA 
Pro Lys Thr 
280 

CCG GCT GGT 
Pro Ala Gly 
295 

AAT GGA ACA 
Asn Gly Thr 
310 

ACA CAT GGA 
Thr His Gly 



AAT GGC AGT 
Asn Gly Ser 



AAT TTC ACG 
Asn Phe Thr 
350 

GAA TCT GTA 
Glu Ser Val 
365 

AGA ACA AGT 
Arg Thr Ser 
380 

ACA GGA GAT 
Thr Gly Asp 



ATT AGT AGA 
He Ser Arg 



AAA AAA TTA 
Lys Lys Leu 
420 

AAT CAC TCC 
Asn His Ser 
435 

TTT AAT TGT 
Phe Asn Cys 
450 

CTG TTT AAT 
Leu Phe Asn 



ACT TGG GAT 
Thr Trp Asp 



AGT TGT AAC 
Ser Cys Asn 
270 

TCC TTT GAG 
Ser Phe Glu 
285 

TTT GCG ATT 
Phe Ala He 



GGA CCA TGT 
Gly Pro Cys 



ATT AGG CCA 
lie Arg Pro 
325 

CTA GCA GAA 
Leu Ala Glu 
340 

GAC AAT GCT 
Asp Asn Ala 
355 

ATA ATT GAT 
He He Asp 



ATA CCT ATG 
He Pro Met 



GTA ATA GGA 
Val He Gly 
395 

GCA GGA TGG 
Ala Gly Trp 
410 

AGA GAA AAA 
Arg Glu Lys 
425 

TCA GGA GGG 
Ser Gly Gly 



GGA GGG GAA 
Gly Gly Glu 



AGT ACT TGG 
Ser Thr Trp 
465 

GAT ACT AAA 
Asp Thr Lys 
480 



1495 
1537 

1579 
1621 
1663 
1705 
1747 
1789 
1831 
1873 
1915 
1957 
1999 
2041 
2083 
2125 
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TGG TCA AAA GGC ACT AAC GAA AAT GAC ACA ATC ACC CTC CAT 2167 
Trp Ser Lys Gly Thr Asn Glu Asn Asp Thr lie Thr Leu His 
485 490 

TGC AGA ATA AAA CAA ATT ATA AAT ATG TGG CAG GAA GTA GGA 2209 
Cys Arg He Lys Gin He He Asn Met Trp Gin Glu Val Gly 
500 505 

AAA GCA ATG TAT GCC CCT CCC ATC AAA GGA CAA ATT AGA TGT 2251 
Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin He Arq Cvs 
510 515 520 

GAA TCA AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT GGT 2293 

15 Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
525 530 535 



10 



20 



AAC GAC ACG AGC AAG AAT AAC ACT GAG ATT TTC AGA CCT GGA 2 335 
Asn Asp Thr Ser Lys Asn Asn Thr Ciu Hp phe Arq Pro Gly 
540 545 550 

GGA GGA AAT ATG AAG GAC AAT TGG AGA ACT GAA TTA TAT AAA 2377 
Gly Gly Asn Met Lys Asp Asn Trp Arq Ser Glu Leu Tyr Lys 
25 555 560 565 

TAT AAA GTA ATA AAA ATT GAA CCA TTA GGA GTA GCA CCC ATC 2 419 
Tyr Lys Val He Lys lie Glu Pro Leu Gly Val Ala Pro He 
30 570 57S 579 



35 



40 



45 



50 (2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 amino acids 

(B) TYPE: Amino Acid 
{ D ) TOPOLOGY: Linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser Val Ser Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Lys Pro Cys 
1 5 in i c 



60 



65 



TAGGCAAAGA 


GAAGAGTGGT 


GCAGAGAGAA 


AAAAGAGCAG 


TGACACTAGG 


2469 


AG CT ATG TTC 


CTTGGGTTCT 


TGGGAGCAGC 


AGGAAGCACT 


ATGGGCGATA 


2519 


AGCTTTAATG 


CGGTAGTTTA 


TCACAGTTAA 


ATTCGTAACG 


CACTCAGGCA 


2569 


CCGTGTATGA 


AATCTAACAA 


TGCGACCTGC 


AGAAGCTTAG 


AACCGAGGAA 


2619 


CTTGTTTATT 


GCAGCTTATA 


ATCGTTACAA 


ATAAACCAAT 


AGCATCACAA 


2669 


ATTTCACAAA 


TAAAGCATTT 


TTTTCACTGC 


ATTCTAGTTG 


TGGTTTGTCC 


2719 


AAACTCATCA 


ATGTATCTTA 


TCATGTCTGG 


ATCGGGAATT 


AATTCGGCGC 


2769 


AGCACCATGG 


CCTGAAATAA 


CCTCTGAAAG 


A 




2800 



5 10 15 

Ser Glu Tyr Cys Leu Ser His Leu Val Asn Leu Arg Glu Asp Trp Gly 
20 25 30 

Pro Cys Asp Lys Leu Gin Arg Glu Arg Pro Thr Thr Pro He He Ser 
35 40 45 
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Tyr Pro * Gly Leu Phe Cys Val Val Arg Ser Gly Met Gly Gly Thr 
50 55 60 

Ala Ala Arg Leu Gly Ala Val He Leu Phe Val Val He Val Gly Leu 
65 70 75 80 

His Gly Val Arg Gly Lys Tyr Ala Leu Ala Asp Ala Ser Leu Lys Met 
85 90 95 

Ala Asp Pro Asn Arg Phe Arg Gly Lys Asp Leu Pro Val Leu Asp Gin 
100 105 110 



15 



20 



Leu Leu Glu Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys 
115 120 125 

Ala Ser Asp Ala Lys Ala Tyr Lys Thr Glu Ala His Asn Val Trp Ala 
130 135 140 

Thr His Ala Cys Val Pro Thr Asp Pro Lys Pro Gin Glu lie Lys Leu 
145 150 155 160 

Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu 
165 170 175 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Gin Met His Glu Asp He lie Ser Leu Trp Asp Gin Ser Leu Lys Pro 
180 185 190 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu 
195 200 205 

Arg Asn Asn Thr Asn Thr Asn Ser Thr Tyr Gly Lys He Met Glu Gly 
210 215 220 

Gly Glu He Lys Asn Cys Ser Phe Asn He Thr Thr Ser He Lys Asp 
225 230 235 240 

Lys Leu Lys Asp Met Ser Leu Phe Tyr Lys Leu Asp Val Val Pro He 
245 250 255 

Gly Asn Asn Ser Asn Thr Thr Ser Tyr Arg Leu He Ser Cys Asn Thr 
260 265 270 

Ser Val He Thr Gin Ala Cys Pro Lys Thr Ser Phe Glu Pro He Pro 
275 280 285 

He His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp 
290 295 300 

Asn Lys Phe Asn Gly Thr Gly Pro Cys Pro Asn Val Ser Thr Val Gin 
305 310 315 320 

Cys Thr His Gly He Arg Pro Val Val Ser Thr Cln Leu Leu Leu Asn 
325 330 335 

Gly Ser Leu Ala Glu Lys Glu VaL Val Leu Arg Ser Glu Asn Phe Thr 
340 345 350 

Asp Asn Ala Lys Thr He He Val Gin Leu Asn Glu Ser Val He He 
355 360 365 

Asp Cys Met Arg Pro Asn Asn Asn Thr Arg Thr Ser He Pro Met Gly 
370 375 380 

Pro Gly Lys Ala Phe Tyr Ala Thr Gly Asp Val He Gly Asp He Arg 
385 390 395 400 
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Arg Ala His Cys Asn lie Ser Arg Ala Gly Trp Asn Thr Thr Leu Gin 
405 410 415 

Gin lie Ala Lys Lys Leu Arg Glu Lys Phe Glu Asn Lys Thr He Val 
5 420 425 430 

Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val Met His Thr Phe 
435 440 445 

.0 Asn Cys Gly Gly Glu Phe Phe Cys Cys Asn Ser Thr Pro Leu Phe Asn 
450 455 460 

Ser Thr Trp Asn Asp Ala Gin Leu Phe Asn Ser Thr Trp Asp Asp Thr 
465 470 475 480 

D 

Lys Trp Ser Lys Gly Thr Asn Glu Asn Asp Thr He Thr Leu His Cys 
485 490 495 

Arg He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met 
0 500 505 5io 

Tyr Ala Pro Pro He Lys Gly Gin lie Arg Cys Glu Ser Asn He Thr 
515 520 525 

5 Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asp Thr Ser Lys Asn Asn 
530 535 540 

Thr Glu He Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg 
Q 545 550 555 K 560 

Ser Glu Leu Tyr Lys Tyr Lys Val He Lys He Glu Pro Leu Gly Val 
565 570 575 

Ala Pro He 

5 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 base pairs 
0 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

5 ATGGGGGGGA CTGCCGCCAG GTTCGGGGCC GTGATTTTGT TTGTCGTCAT 50 

AGTGGGCCTC CATGGGGTCC GCGGCAAATA TGCCTTGGCG GATGCCTCTC 100 

TCAAGATGGC CGACCCCAAT CGATTTCGCG GCAAAGACCT TCCGGTCCTG 150 

GACCAGCTGC TCGAG GTA CCT GTG TGG AAA GAA GCA ACC ACC 192 
Val Pro Val Trp Lys Glu Ala Thr Thr 
1 5 

ACT CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT ACA GAG 234 
Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
10 is 20 

GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 27 6 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
25 30 35 
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CCC AAC CCA CAA GAA ATA GGA TTG GAA A AT GTA AC A GAA AAT 318 

Pro Asn Pro Gin Glu lie Gly Leu Glu Asn Val Thr Glu Asn 

40 45 50 

5 TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAT GAG 360 

Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His Glu 

55 60 65 

GAT ATA ATC AGT TTA TGG GAT CAA AGC TTA AAG CCA TGT GTA 402 

10 Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val 

70 75 

AAA TTA ACC CCA CTA TGT GTT ACT TTA AAT TGC ACT GAT TTG 444 

Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu 

15 80 85 90 



AAA AAT GCT ACT AAT ACC ACT AGT AGC AGC TGG GGA AAG ATG 4 86 

Lys Asn Ala Thr Asn Thr Thr Ser Ser Ser Trp Gly Lys Met 
20 95 100 105 

GAG AGA GGA GAA ATA AAA AAC TGC TCT TTC AAT GTC ACC ACA 528 

Glu Arg Gly Glu He Lys Asn Cys Ser Phe Asn Val Thr Thr 
110 H5 120 



25 



55 



AGT ATA AGA GAT AAG ATG AAG AAT GAA TAT GCA CTT TTT TAT 570 
Ser He Arg Asp Lys Met Lys Asn Glu Tyr Ala Leu Phe Tyr 
125 130 135 



30 AAA CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACT AGC TAT 612 
Lys Leu Asp Val Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
140 145 

AGG TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCC TGT 654 

35 Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys 
150 155 160 

CCA AAG GTG TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC 696 

40 Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys Ala 
165 170 175 

CCG GCT GGT TTT GCG ATT CTA AAG TGT AGA GAT AAA AAG TTC 738 
Pro Ala Gly Phe Ala He Leu Lys Cys Arg Asp Lys Lys Phe 
45 180 185 190 

AAC GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA TGT 780 
Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys 
195 200 205 

50 

ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTA 822 
Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu 
210 215 

AAT GGC AGT TTA GCA GAA GAA GAA GTA GTA ATT ACA TCT GCC 864 
Asn Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Ala 
220 225 230 

60 AAT TTC TCG GAC AAT GCT AAA ACC ATA ATA GTA CAG CTG AAC 906 
Asn Phe Ser Asp Asn Ala Lys Thr He He Val Gin Leu Asn 
235 240 245 
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GAA TCT GTA GAA ATT AAT TGT ACA AGA CCC AAC AAC AAT ACA 948 
Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
250 255 260 

5 AGA AGA AGT ATA CAT ATA GGA CCA GGG AGA CCA TTT TAT GCA 990 
Arg Arg Ser He His He Gly Pro Gly Arg Ala Phe Tyr Ala 
265 270 275 

ACA GGA GAA ATA ATA GGA GAC ATA AGA CAA GCA CAT TGT AAC 1032 
Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys Asn 
280 285 



10 



CTT AGT AGC ACA AAA TGG AAT AAT ACT TTA AAA CAG ATA GTT 1074 
15 Leu Ser Ser Thr Lys Trp Asn Asn Thr Leu Lys Gin He Val 
290 295 300 

ACA AAA TTA AGA GAA CAT TTT AAT AAA ACA ATA GTC TTT AAT 1116 
Thr Lys Leu Arg Glu His Phe Asn Lys Thr lie Val Phe Asn 
20 305 310 315 

CAC TCC TCA GGA GGG GAC CCA GAA ATT GTA ATC CAC AGT TTT 1158 

His Ser Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe 
320 325 330 

z o 

AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA CCA CTG 1200 

Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Pro Leu 

335 340 345 

30 TTT AAT AGT ACT TGG AAT TAT ACT TAT ACT TGG AAT AAT ACT 1242 
Phe Asn Ser Thr Trp Asn Tyr Thr Tyr Thr Trp Asn Asn Thr 
350 355 

GAA GGG TCA AAT GAC ACT GGA AGA AAT ATC ACA CTC CAA TGC 1284 
35 Glu Gly Ser Asn Asp Thr Gly Arg Asn lie Thr Leu Gin Cvs 
360 365 370 

AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA CTA GGA AAA 1326 
Arg He Lys Gin He He Asn Met Trp Gin Glu Val Glv Lvs 
40 375 380 385 Y Y 

GCA ATG TAT GCC CCT CCC ATA AGA GGA CAA ATT AGA TGC TCA 1368 

Ala Met Tyr Ala Pro Pro He Arg Cly Gin He Arg Cys Ser 
45 390 395 400 

TCA AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT GGT AAT 1410 
Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn 
405 410 415 

AAC AGC GAA ACC GAG ATC TTC AGA CCT GGA GGA GGA GAT ATG 1452 
Asn Ser Glu Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met 
420 42b 

55 AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA 1494 
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
430 435 440 



50 



60 



AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC A AG GCA TAA 1533 
Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala * 
445 450 455 
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10 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 456 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp 
15 10 15 

Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala 
20 25 30 

Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie Gly Leu Glu Asn Val 
!5 35 40 45 

Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
50 55 60 

Glu Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys 
65 70 75 80 

Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Lys Asn Ala 
85 90 95 

Thr Asn Thr Thr Ser Ser Ser Trp Gly Lys Met Glu Arg Gly Glu He 
100 105 110 

Lvs Asn Cys Ser Phe Asn Val Thr Thr Ser He Arg Asp Lys Met Lys 
y 115 120 125 

Asn Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro He Asp Asn 
130 135 140 

35 Asp Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr 
145 150 155 160 

Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys 
165 170 175 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Arg Asp Lys Lys Phe Asn 
180 185 190 

Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly 
45 195 200 205 

He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala 
210 215 220 

Glu Glu Glu Val Val He Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys 
225 230 235 240 

Thr He He Val Gin Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg 
245 250 255 

Pro Asn Asn Asn Thr Arg Arg Ser lie His lie Gly Pro Gly Arg Ala 
260 265 270 

Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys 
60 275 280 285 

Asn Leu Ser Ser Thr Lys Trp Asn Asn Thr Leu Lys Gin He Val Thr 
290 295 300 



40 



50 



55 
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Lys Leu Arg Glu His Phe Asn Lys Thr He Val Phe Asn His Ser Ser 
305 310 315 320 

Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu 
5 325 330 335 

Phe Phe Tyr Cys Asn Thr Thr Pro Leu Phe Asn Ser Thr Trp Asn Tyr 
340 345 350 

10 Thr Tyr Thr Trp Asn Asn Thr Glu Gly Ser Asn Asp Thr Gly Arg Asn 
3S5 360 365 



15 



30 



lie Thr Leu Gin Cys Arg He Lys Gin lie He Asn Met Trp Gin Glu 

370 375 380 

Val Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin lie Arg Cys 

385 390 395 400 



Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn 
20 405 410 415 

Ser Glu Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
420 425 430 

25 Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu 
435 440 445 



Gly Val Ala Pro Thr Lys Ala * 
450 455 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
35 <B> TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

4 0 GGGAATTCGG ATCCAGAGCA GAAGACAGTG GCAATGA 3 7 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

50 

CTCGAGCTCC TGAAGACAGT CAGACTCATC AAG 33 



(2) INFORMATION FOR SEQ ID NO: 36: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

GGTCTAGAAG CTTTAGCCCA TAGTGCTTCC TGCTGCTCC 39 
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(2) INFORMATION FOR SEQ ID NO:37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GGGCGGATCC TCGAGGTACC TGTRTGGAAA GAAGCA 36 

10 

(2) INFORMATION FOR SEQ ID NO:38: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
15 ( B } TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

20 GGTCTAGAAG CTTTATGCTC CYAAGAACCC AAGGAACA 38 



(2) INFORMATION FOR SEQ ID NO: 39: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 {xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

lie Gly Pro Gly Arg Ala Phe 
1 5 

35 (2) INFORMATION FOR SEQ ID NO:40: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
{ B ) TYPE: Amino Acid 
(C) STRANDEDNESS: Single 
4 0 ( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 40: 



45 



He Gly Pro Gly Arg Ala Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 
50 (B) TYPE: Amino Acid 

( C) STRANDEDNESS : Single 

(D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

55 Leu Gly Pro Gly Ser Thr Phe 

1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 
{ D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

65 
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lie Gly Pro Gly Arg Val Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 43: 
5 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 

<C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 
10 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

He Gly Pro Gly Ser Ala Phe 
1 5 

15 (2) INFORMATION FOR SEQ ID NO:44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



He Gly Pro Gly Arg 
1 5 



25 
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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an HIV gpl20 
amino acid sequence selected from the group 

5 consisting of Sequence ID Nos. 2, 4, 6, 8, 10, 12, 

14, 16, 18, 20, 22, 24, 26, and 28, and fragments 
thereof . 

2. The polypeptide of Claim 1 wherein the polypeptide 
10 additionally comprises a flag epitope sequence. 

3. The polypeptide of Claim 2 wherein the flag 
epitope sequence is HSV gD-1 flag epitope 
sequence . 

15 

4. The polypeptide of Claim 2 wherein the flag 
epitope sequence is fused to the HIV gpl20 amino 
acid sequence. 

20 5. An oligonucleotide of not more than five kilobases 
encoding an HIV gpl20 polypeptide sequence 
comprising an amino acid sequence selected from 
the group consisting of Sequence ID Nos. 2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 28, and 

25 fragments thereof. 

6. The oligonucleotide of Claim 5 wherein the 
oligonucleotide includes a nucleotide sequence 
selected from the group consisting of Sequence ID 

30 NOS. 1, 3, 5, 7, 9, 11, 13, 15, 7, 19, 21, 23, 25, 

and 27, and fragments thereof. 

7. The oligonucleotide of Claim 5 wherein the amino 
acid sequence encoded by the oligonucleotide 

35 additionally comprises a flag epitope. 
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8. The oligonucleotide of Claim 5 wherein the flag 
epitope is HSV gD-1 flag epitope. 

9. The oligonucleotide of Claim 7 wherein the flag 
epitope is fused to the HIV gpl20 amino acid 
sequence. 

10. A vaccine comprising gpl20 MN and an HIV gpl20 
polypeptide sequence selected from the group 
consisting of Sequence ID Nos. 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, and 28, and fragments 
thereof in a suitable carrier. 

11. A vaccine comprising: 

15 a. a first gpl20 polypeptide sequence or a 

fragment thereof; and 
b. a breakthrough isolate HIV gpl20 polypeptide 
sequence or a fragment thereof from a 
vaccinee vaccinated with said first HIV gpl20 
20 polypeptide sequence; 

wherein said HIV gpl20 polypeptide sequences are 
in a suitable carrier. 

12. The vaccine of Claim 11 wherein said first HIV 
25 gpl20 polypeptide sequence comprises gpl20 MN, 

gpl20 A244, gpl20 MN-GNE6 (Sequence ID No. 31), or 
gpl20 MN-GNE8 (Sequence ID No. 33) . 

13. The vaccine of Claim 12 wherein said vaccine 

30 additionally comprises a second gpl20 polypeptide 

sequence comprising gpl20 MN, gpl20 A244, gpl20 
MN-GNE6 (Sequence ID No. 31), or gp!20 MN-GNE8 
(Sequence ID No. 33), or a fragment thereof, 
wherein said second HIV gpl20 polypeptide sequence 

35 is different from said first HIV gpi20 polypeptide 

sequence . 
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14. The vaccine of Claim 13 wherein said first gpl20 
polypeptide sequence comprises gpl20 MN and said 
second gpl20 polypeptide sequence comprises gp!20 
A244. 

5 

15. The vaccine of Claim 14 wherein said breakthrough 
isolate comprises an HIV gpl20 polypeptide 
sequence selected from the group consisting of 
Sequence ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 

10 20, 22, 24, 26, and 28, and fragments thereof in a 

suitable carrier. 



16. The vaccine of Claim 13 wherein said first gpl20 
polypeptide sequence comprises gpl20 MN and said 

15 second gpl20 polypeptide sequence comprises gpl20 

MN-GNE8 (Sequence ID No. 33). 

17. The vaccine of Claim 16 wherein said breakthrough 
isolate HIV gpl20 polypeptide sequence is an HIV 

20 gpl20 polypeptide sequence selected from the group 

consisting of Sequence ID Nos. 2, 4, 6, 8, 10, 12, 
14, 16 # 18, 20, 22, 24, 26, and 28, and fragments 
thereof, in a suitable carrier. 

25 18. The vaccine of Claim 13 wherein said breakthrough 
isolate HIV gpl20 polypeptide is from a vaccinee 
vaccinated with said first and second HIV gpl20 
polypeptide sequences. 
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A method for making an HIV vaccine comprising 
adding an HIV gpl2 0 polypeptide sequence or 
fragments thereof from a breakthrough isolate from 
a vaccinee to the vaccine with which the vaccinee 
was vaccinated. 

The vaccine of Claim 11 wherein said first gpl20 
polypeptide sequence is from a macrophage-tropic 
HIV-l strain. 

The vaccine of Claim 11 wherein said first gpl20 
polypeptide sequence is from a T-cell-tropic HIV-l 
strain. 

22. The vaccine of Claim 21 wherein said vaccine 
additionally comprises a second gpl20 polypeptide 
sequence or a fragment, from a macrophage-tropic 
HIV-l strain. 

23. The vaccine of Claim 22 wherein said first and 
second gpl20 polypeptide sequences bind to 
different chemokine receptors. 

24. The vaccine of Claim 23 wherein said first gpl20 
polypeptide sequence binds to CC-CKR-5 and said 
second gp 120 polypeptide sequence binds to CXC- 
CKR-4. 

25. The vaccine of Claim 11 wherein said vaccine 
additionally comprises an virus engineered to 
induce a cytotoxic T-cell response. 



19. 



5 



20. 



10 

21. 



-166- 



WO 98/01564 



PCT/US97/09690 



1/22 



o 

CM 
P> 



100000- 



10000 



1000 - 




mm. 



-■ -* 




3000 
2500 

1-2000 
1500 
-1000 
-500 



t — r r i' * ' r *' 'i * ' 'i * ' ' i 1 ' ' V ' r' ' f ' " i' 1 i 
0 8 16 24 32 40 48 56 64 72 80 88 96 

ft + t 

Time (weeks) FIG 



CO 

> 



1A 



o 

CM 
P> 



1000001 



10000 



1000 



100 i 



10 




t — r 



's<:-<:-<y<s-:s<y-:s<s<t:4 2000 

{'.<>.<•:•:•:<».<•:<>.< 
• % * / *. .• .■ . .* . .• ■ * • .• • * • ,* ■ 

mmm#m 1500 
mmmmm 



^/^v^W^vvvv- 500 

■ -;• :.* .*;• .*.** ;.• .*.* 



'' i 1 ' 1 ! 1 1 r' 1 i 1 r ' i ' 'i' t ' i ' 



<»1000 



CO 

> 
o 

P 



FIG. 1B 



-8 0 8 16 24 3240 48 56 64 72 80 88 
ft + 

Time (weeks) 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 PCT/US97/09690 



2/22 




0 8 16 24 32 40 48 56 64 72 80 88 96 



tt t + t 

Time (weeks) FIG. 1C 




AMENDED SHEET (ARTICLE 19} 



WO 98/01564 



PCT/US97/09690 



3/22 



o 

CM 

§5 



100000" 



10000 -s 



1000 



nn 




100-6 



-400 



-300 



■500 



-100 



-200 r 



o 

0> 



r— 

\ 



i — i — i — i — i — i — i — i — i i' 1 r 1 r 
0 8 16 24 32 4048 5664 72 80 88 96 

tt t 

Time (weeks) FIG. 1 E 



o 

CM 



CL 
P> 



0> 



1000001 



10000 ^ 



1000- 




4000 



[-3000 



-2000 



1000 



10 -T — i — i — i — i — i — r*T — i i " i — r 
FIG 1F 0 8 162432 40 485664 7280 8896104 

,F tt f f 

Time (weeks) 
AMENOED SHEET (ARTICLE 19) 



CO 

> 



CD 



t— 

\ 



WO 98/01564 



4/22 



PCT/US97/09690 




t t t 

Time (weeks) 



FIG. 1G 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



5/22 



10000 



j§ 1000 

CD 

m 

o 
o 



o 
in 

O 



10 



FIG. 2A 



ft 



*\ v\* 



r. v. -v. a %*• •!>'• *.'• a % 

- - /,• /> .*.* .*.* ••*• 




. **V- -I". -V. \* \* \ • -V- 



is 



:'*•*',***/*■'/ •*/'*■.•''•'.•*'.**•".* 
.• *. • *. .■ •. .* . „■ * •. „• . .* . - 



*v- *'.% *'. 



:^v;vv.v^\VcVV.-v.-y.--. 
^vVvVvV^ 



. . .'•■*.*•**.'•* 




7 •. *. •* •. */. .* *• . * •*•*.'■*'/• *.*•*.*• -V- *.•• *. 



T 

20 



40 
t 

Weeks 



60 



80 



10000 



c: 
o 

m 

o 

o 



1000- 



~ 100- 



o 
m 
O 



10 



mmmmmmm- 




***.*. ' •*. * '■*. "•'.*. "•*.*• ''.*• *" * •'.*••*.*• ■*.*• *.*•*.* •.' 



FIG. 2B 



10 0 10 20 30 40 50 60 
t + 

W66ks 

AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



6/22 



10000 



g 1000 

c 
J* 
o 
_o 

CD 
Q 

~ 100-1 

m 
O 



10 



•V. *.*• *.*. \ • *.*• 'j^I 



•rtw* 

«.. *..■*.. *. t* .- . 

y/y^/y/y/y/y^y^ 



V • •*.* •*. ' • .• •*.* •*.* •*. 



•■ V-'V •• v * V-* v*V'*v.* v* v ■* V* W v.* 



— T - 

20 



-*r- 
40 



IS" 



80 



FIG. 2C 



Weeks 



10000 



g 1000 



o 
CD 

Q 

O 

o 
in 
O 



FIG. 2D 




40 60 
t 

WG6ks 

AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



7/22 



10000 



ji 1000 



o 
o 



CO 

Q 

O 

o 

o 



FIG. 2E 




Weeks 



10000 



CD 



O) 

c 

lac 
o 
_o 

CD 

O 

O 

o 
in 
O 



1000- 



WIS 
VAvY.- 

Si 
m 
m 



FIG. 2F 




80 100 120 
t + 



Weeks 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



8/22 



10000 



j| 1000- 

o> 




AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



9/22 

"ujujuuluO ©Tu CD > > O O uiTu > 




> > >|ul uj luI> >Tiu iu m 

> > > >l >>> 



o 



0.0.0-0. 
OOOO 



UJUJLULULUUJIUUJLJULUUJ 

ooooooooooo 

CL Q- Q- Q- Q- Q.Q.Q.Q.Q.O- 

z z |co co| z z z 2 2: z 

0-0-0.0.0.0-0-0.0.0-0. 

ooooooooooo 



> 
o 
< 

X 



<< 



O.0L 

>> 
oo 
<< 
II 

I— H- 

<< 

55 

>> 
zz 

XX 

»l 

LULU 



Q-Q.O_Q_Q_Q-Q-Q_G-.0l 

>>>>>>>>>> 
oooooooooa 
<<<<<<<<<< 
xxxxxxxxxx 



<<< 

555 

>>> 
zz z 

XXX 
UJ UJ uu 



l-h-h-h-H-h-h- H-h- H- |X X CO CO]h- 



<<< 

>>> 
zz z 

X 

>>> 
foluiiu 



<< << 

5555 

>>>> 
zzzz 

xxxxx 



>r<<< 



LULU LULU 



Q 

< 
-<_ 
< 
o 

CO 

< 
o 



oo 
>- >- 
<< 

<<: 
oo 
coco 
<< 
oo 



zjoo 
< < < 



<<< 

OQO 

CO CO CO 

<<< 
ooo 



QOQ 

<<< 
^ ^ ^: 
<<< 

OOQ 
CO CO CO 

<< < 
ooo 



QOQO 

>->->-> 
<<<< 
_<__<:*:-*: 
<< < < 
oooo 

CO CO CO CO 

<<<< 
oooo 



■h- I— I— l—h-l— I— I— I- H- 1 — (— I— 
I — 1 — I — I — I — I — I — I — H- h- h h t— h- 
H- I — I — I — I — H-H- H- HZ Zj h- I— h- 
<<<<<<<<<<<<<<< 

U J I f I I J I 1 1 1 III 111 LU III i U t J I III Lt_I LU LU LU 

5 55555 5 5555555 
>>>>>>>>>>>>>>> 

O.Q.O.a.O-Q-Q-O.Q-O.CLQ-Q.Q-0. 
>>>>>>>>>>>>>>> 



cm co omstnsi-co 

oooooooooooooo 



LU 

o 



AMENDED SHEET (ARTICLE 13) 



WO 98/01564 



PCT/US97/09690 



10/22 



I— I— 
OQ 

^ (XI 

55 



COCO 



>> 



-|Q Q O O Q O Q O O Q O 

I — i — i — i — i — Hco CO CO CO ^ ^l t— h- 

ooooooooo o o o o o o 
zz:zzzzzzz z|?i]z z 



o 



— — 1 > >>>>>>>>>>>> 
000000000000000 



a.Q_Q-Q_a.Q_Q_a.Q-Q-a-Q-O-Q-0L 
H-H-H— I — H- I — I — I — I — h~ I— H- I — h— h- 
I _J I I —J -—J — I — I — I —J — I —J —I — * — 1 

>>>>>>>>>>>>>>> 
000000000000000 

Q-CLQ-Q-Q-CLQ-O-Q-CLQ-a-Q-Cl-Q- 
^^^^^^^^^^^^^^^ 

I I I I I — I — I — I — » — I — I — I — I — I — " 

CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO 

oooooooo|wi]ooooo 

OOOOOOOOOOOOOOO 



cocococococococococol; 



CO CO co 



o[x]o 


a 


Q 


O 


00000 


0000 


LLI LU LU 


at 


LU 


LU 


LU LU LU LU LU 


LU|CD OjUJ 


XXX 


X 




X 


I0IIX 


XX 


XX 


555 




5 


5 


55555 


55 


55 


OOO 


O 


a 


a 


OOOOO 


aa 


OO 


LU LU LU 


LU 


LU 


LU 


LU LU|Q o]m 


LULU 


LULU 


>>> 


> 


> 


> 


>>>>> 


>> 


>> 


555 


5 


5 


5 


55555 


55 


55 


OOjz 






z 


zz|*:*:|z 


zz 


ZZ 


ZZZ 








z z zz z 


zz 


zz 










^ ^ ^: 




^ 


555 


5 




5 




55 


55 


555 


5 


5 


5 


55555 


55 


55 


ZZZ 










ZZ 


ZZ 


LL.LL.LL_ 


U_ 


u. 


Li. 


LL. LL. UL LL. LL 


LL. UL 


Li. U. 


QO|z 


Z 


Z 


z 


zzfoojz 


z z 


ZZ 


LUUJLU 


LU 


LU 


LU 


hi 111 mill lu 


LULU 


LULU 


l-H-l— 


1— 


I— 




1— 1 — 1 — 1 — 1 — 


h- h- 


1— J— 


>>> 


> 


> 


> 


> > >> > 


>> 


>> 


ZZZ 


z 


Z 


z 


z z zz z 


ZZ 


ZZ 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



11/22 



ccaza: cn a: az^ CCCCCCCC& 



> 



\CD\CO CO 



COCOCOCOXX 



co| 

H-h-l— I— H- I— H- I— I— I— I— I— I— I— I— 



:zz: 



cococococococococo 
ooooooooo 

ZZZ Z |CC CEj Z Z Z 

fEssl 

LLl 

CD 



coco coco coco 
OOOOOO 
zzzzzz 



UJLUUJLLILLJL1JLLILLJ 

OC5 00CDOC50 



UJ 

R5" 



CO 

Ro 



lu dc cc cccc ^: 

^liiLU52UJLJJUJ 

m • ■ • 

CJQCDl uj uj|Q Q[ lu 

COt— I— ODCOZZU 

col>->- 




LU 



O CD 



UJ UJ 



CO CO CO CO CO 
CO CO CO CO CO 



CO CO CO 
CO CO CO 



UJ 
CO 

oo[co| 

UJUJZ 



I— I— I— h- H-l— h-H-H-l— 


ZZ 


f- Hz z|h- Ho Q < < 


1 — 1 — 1 — 1 — | 



C/DCOZZ <<HHHI OCCCQ 

COCO<<COCO<<Q-CL I — I — UJ UJ I 



coco 
zz 



UJ UJ 



coco 



CO CO CO CO CO 



^ ^ CO CO 
<< I— >— H-l->->-COCO>->-H-l— Z 
H- I— 2: Z Z ZOQhhh h- > > I— 
< < Z Z|>— HHHh-Hh- h]Z Z Z 
WCO<<<<<<ZZZZCLQ.h 

wcozzzzzzhhua: 
zzujujoodccccocoujuj 
z:z-j— «-j-»>->-zz-j-j 

h-h-ZZOQ • h-l 



coco 
coco 
coco 



■QO 



CLCL 



t— v OJ CNJ CO CO CO CO GO GO CO CO CO CO CO 
^•^■COCOCOCOCOCOCOCOCOCOCOCOCO 



o 

CO 

CD 

LL 



UJ 

CM CO Ol^NlONr-CO Z 

(DCOOOOOT-r-SSi-T-T-r-i-T-^ 
OOOOOOOOOOOOOOS 

AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



12/22 



|0, CL P CLQ-CLOOOOO.CLQ,00 

t— i—i < <<<<<<<<<<<< 
o o o oooo oooooooo 

>->->- > |llll1 >- 

~:xxxxxxxxxxxxxx 



CM 
O 



CL OOO 

O. Q.0Q. 
LUULJUJUJ 
LULL Li. LL 
V-H-j cOCO 
>>>> 

OLQ.Q.Q- 

oooo 
< < << 
oooo 



CL O O O O O O 

— B 

OO.OOOOCL 
ii|iiiinnini linn 
LL LL LL LL LL LL LL 

co co co | h- 1 — [co 

>>>>>>) h- 

_^ ___j2J^ 
CLOOOOOOL 

ooooooo 
<<<<<<< 
ooooooo 



CL CL O, O 

O CL O O 
LU LU LU LU 
LL LL LL LL 
CO CO COCO 

3>>E 

OOOO 

oooo 
<<£<< 
oooo 

H- I— H- h- 



>>>>>>> >|Q[> > > > > > 
COCOCOCOCOCOCOCOCOCOCOCOCOCOCO 
I — I — I — h-H- I— I — I — I — H-H- I— h-H-h- 

o o o oooo oooooooo 

CO co co co [z Z| cO COCOCOCOCOCOCOCO 



xoc 



ccccdcccdc o qi— h-| q: cccccccccca: 
>- >->->->->->- >->->->•>->->->- 

CO CO|___ ~Z. H- MCO COCOCOCOCOCOCOCO 




UJ LU O O O O 



OQLULUOOO 



in 



CL CLOOIO- OOOOOOOOOO 



> > > > > > > > > > > > [ LU 

> > > > > >l— — — ^> » >\— 
qoqoo qqIzIqqq oooo 



CM 

> 



co co __- -*-|a: a:j_*_ 



< < < < < < < <<<<<<<< 
>->->- > |x x|>- >->->->->•>->->- 

1 1 1 1 1 1| I ) H H " 1 1 1 1 1 1 " ' ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



^:^^y:^^looooooooo 



> > > O — — 2 S < > > > LL. LL- S 



Q 

CO 

d 

LL 



AMENDED SHEET (ARTICLE 19) 



PCT/US97/09690 



O 



13/22 

<<<<<<< <"< <<<<<< 

coeocotocococococococococococo 
OOOOOOOOOOOOCDCDO 



000000000000000 
H-h-l— H-H- I- I— I— h-l-h- l-h- h-H- 

COCOCOCOCOCOCOCOCOCOCOCOCOCOCO 

>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 

Q.O. Q. 0-0.0.0-0-0-0-0. 0. 0.0- 0- 

^3cc actrccccaccrcccc crf^Sa: 



OOOOCDOCDOOOOCDOOC 

_:xxxxxxxxxxxxxx 
<1h- h— i— i— i— i— i— i— i— f— i— f— i — I— 
ooooooooooooooo 
ooooaoooooooaoo 
>>>>>>>>>>>>>>> 



co co co [eg co coco co co 
>>>>>>>>> 

Z Z Z 2 zfiZ 

^r^^^^coco 



o o o o o oooo 

O- O- Q- 

ooo 



CO CO CO CO CO CO 

>>>>>> 



o, a. q- |lu lu|o. 
OOOOOO 

OOOOOOCDOO 



: z |m lu co co| : 



oooooo 

Q. Q- CL Q. Q-fcO 

oooo oq 

oooooq 
z z z z zlco 



QQQ Q[ZZ]Q OQQQOQQQ 



ooooooooooooooo 



<<<<<<<<<<<<<<< 

LL.ULIX.U-LI-U-LL.U-U-U-U-U-U.U-U- 

ooooooooooooooo 
<<<<<<<<<<<<<<< 



oococoiotncocococoTfxf^TrcD 

^r^rCVICJC\IC\ICVICVJCOCOC\JC\J'r-T-CM 
CMC\iCslCMc3CMCVJCVJCVIC\JCVJCMCMCMC\J 



LU 

CO 

CD 

LL 



OOOOOOOOOOOOOOS 



AMENDED SHEET {ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



14/22 

O O Q QlUJ LU IU UJ LU LU LU Ujj 



CO 

> 



OOOOOOOOOOCD OQQ|^ 
h- hhhh h- *- I— I— h- I- H 

< <|h- >— 1< < < < lh- I— [< < < <[P 



ii 



zjn: i x iiijz 




Ecx: 



OOl co co co co looool co co co cote 
^ ^: ^: i^^:^:^^:^:^:^: ^ 

H- H- I— I — I— I— I — I — I— I— H-V-H^ 



co col: 



0LO.CL 

octree 
ooo 



Q. CL| 

ccoc 
h- 1 — 

o o 



[co|o_ a. a. a. 



CCCCDCCCCCDCQCCC 



oooooooo 

5 





> > >> ;> ;> ;>> 

co co co |a. a, h- \-\ca < 

UJ LU UJ LU LU LU UJ UJ ! 

z|*r ^| z z : 



ooaooooooo lxii o a[x 
>>>>>>>>>>>>>>> 



^ ^ ^ s*: ^ ^ ^ ^ ^ ^ ^ ^ ^ 

<£ < < < |t— h— h- P|< <<<<<< 

zzzzzzzzzzzzzz 
ooooooooooooo 



z: 
coco 



hhh t— pit- I— I — H- 
jU_ LL- ILU-U- 



<:<ujajocDQQ>x<<<uj 



CM 



cncocococococococococococococo 
ococccarccarccccccococxxoca: 



>>>>>>>>> 
> 



a a 



LU 



>> l | > >>>>>>> 

LULULUUJLULULUUJLUUJUJLU 
LUUULULULULUULILUUJUJUJLULUUJUJ 
LU LU LU UJ LU LU LU LU LU LU LU LU LU LU U J 



Li. 
CO 

CD 
LL 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



15/22 



zzzzzzzzzzzzzz 



>><<>></> cn- 



<<>>> 



f— h- h- I — I — t — I — I — I — h~ t— h- h- I— I— 

z z z z| tu tujz z z z z z z z z 

q_ a. • -OO • -OOQQ • • 

. . . • Li- LL • * • OO • ■ 



■ • -OO 



U_ LL LL LL. U_ LL 



LL LL LL U_ LL LL LL 
OOOOOOO 



0 0|^jOO 

LU UJ LU LU UJ UJ LULU LU LU |^ ^| UJ LU UJ 



j< <| >> |< <f> > > >l< <]> 



[S"S1 o ol^: *r|o O O Ok*: ^: ^ ^[o 
^^ccccoo ccocLuiu^^a: 



zz 



LU LU 



I— H- H-l- f- H- H- 1- H- H H hh-l — I— 

z z z z z zh*: ^ Q Q Q 



zzzzzzz 



000 01 ^: ^ ^ ^10 o^^^ 

< <|h- t— I— H- H- h[ LiJ UJ < < |f-h"f < 



CO 



</>COCOCOCOCOCOCOCOCOCOC/5COCO</3 



zzzzzzz z[S_Sj; 
00000 000000 0000 

X X X X X x |>- > >- X X X X 



<<<<<<< < < < < < < < < 
o o o a o ogo|ii]o 0000 
ccocarararccocxccxococxara: 

ooooooooooooo qF- 

OO O O O O O O O OOOOO' 



tree 



> > ^ 



oococomioncococo^^^^co 

^^CMCMCJOJC>JCViCOCOCMCM^7-CAJ 
COCOCQCOCOCOCOCOCOCOCOCOCOCOCO 



CD 

CO 

CD 

LL 



UJ 



00000000000000s 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



16/22 

Q-Q-Q-Q-O.Q,Q-Q,CLCUQ-Q-Q- Q-IO 



hhhhFHHI-h-l-l-KI 



COCO-J-JUJUJOOCOC/)QQQOZ 
QOUJUJGOLiJLiJZZCOCOZZZ 
ZZ • ■ • h»H»00 * CO 

ZZLUUJOOIULUWUJCCCCOCDCD 

z z i- i- h- 1- z z h- 1- z 2 ^ h- 

0-0-1— h-«— 




CO CO CO </> CO CO 



CD Q |Z Z| Q Q |Z ZZ| 
hhOOOCXO-h^y ZZZ 
UILLIh-h-CDO$$COCO$ 
H-H- H- !— LL-LLUIUJ • 'OOh 



Z Z Z Z Z Z 
COO)ZZ§$ 

LUU I — I h- h~ 

O-CLCDCDOCD 



O O — J — i > >jjz; 



CO coo 



zzzzz 



cococococococococococococococo 
zzzzzzzzzzzzzz 

LJUU-U-U-LL.U-LLU-U-U-U-U_U_LL.UL 



tr! 




CO CO 



CO 



OjZ Z Z Z Z Z Z Z 

ooooooooooooooo 



UJ LD LUlOlUJ UJmLUUJLULUUJUJUJLU 
CDCDCDCDCDOOOOCDOCD O OCD 
O Q| CD CD CD CD CD O CD CD CD CD CD CD CD 



CO 

O 



OOOOOOOOOOOOOOO 

zzzzzzzzzzzzzzz 

LL, LL U_LJUU-LLU_LL-U-U_U-U_U- U_ LL 

cococococococococo co|H- 1— Icq co co 



IIIIIIIIII Hp — i 



3= 



> > > >!<<!> > > > > >|<:o 



mULUUJUJUJUJUJ-JUJUJllJLUUJUJ 
O.Cl.Q.Q.CLLVQ.Q.a.Q-0-Q-Q-Cl.C-. 

ooooooooooooooo 

OCDOOCDOOOOCDOCDCDCDO 
OCTCDCDCDCDCDOCDOOCDOCDCD 
cococococococococococococococo 



CO 

d 

LL 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



17/22 



> 



C0C0C3WUJUJZZHHCI5O55Q 
h-h-|-H- Z Z Z Z > > Q Q Z Z H- 
|22Z Z] ^ Lii ULl |Z Z| Q o|z"i 

cocococozzocgc* 



L 



OCDOOOOOOOOOOOOO 
OCDOOOOOOOOGOOOO 

QQQQQQQQQOQQQQQ 

ccococcrocccocccccoccccrrcEcna: 



55 



CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO 
CO CO CO CO CO CO co co | [ co CO CO CO CO 

ooooooooooooooo 



^T^lcc cc ac ocMcc occca: cccc tx dc 



tooooo o |a: cc uj m 

iCDCDCDOOOOOOO 
: oc ac 



aoo 




UJLULULULUUJUJUJUJaJUJUJ 



ooooooooooooooo 
555555555555555 

zzzzzzzzzzzzzz: 




ccccccccccccircrccDCcctrcc cc[^ 
oooooooooooooool 



C\JC\lU--r-C\JCM---«-<NCVJ"-T-Og»- 



CO 

d 



LU 

cnjco omsmNT-co ^ 
OOOOOOOOOOOOOOO 



AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



CCCCOC 

LU LU LU 

CCGCCC 

ooo 
>>> 
>>> 
ccccoc 

CCGCCC 

<<c < 



ccccoc 

LUUJUJ 
OCCCCC 
OOO 

>>> 
>>> 

CCGCCC 

tree cr 

*L 

<:<< 



18/22 
DC cc oc cc cc cc cc 
^^^^^^^ 



cc 

LU - Q 0|UJ LU LU LU LU 

cc CC CC CC CC CC CC CC CC 

|ooooo 
>>>>> 

>>>>> 

ccccccccccccccoccc 




oc 
< 



CO 
O 



ccccccccaicccc 
^ ^ ^: ^ ^: ^ 
<<<<<<< 



Q.O.O. 

<<< 
>>> 
ooo 



0.0.0.0.0.0.0.0.0. 

< < < < << < < < 
>>> 



Q_Q_ O. 

<<< 

>>> 

oooooooooooo 



>> 



0.0.0.0-0-0.0-0.1 
LULULULULULULULUI 

^ ^|cc crl *: ^ ^ : 
>>>>>>>>: 
>>>>>>>>: 

>>->->•>•>->>-: 

>->->->->- >- >- >- : 



.Q.Q-O.GLO. 
IUJLULUUJUJ 

: ^ i*; ^ i^j^ 
>>>>> 
>>>>> 

>>>->->- 

: |cc cc] ^: ^ ^ 

>->->->->» 



LU 


LU 


LU LU LU 


LU UJ LU LU 


LU 


UJ 


LU 


UJ 


LU 


UJ 


CO 


CO 


CO CO CO 


CO CO CO CO 


CO 


CO 


CO 


CO 


CO 


CO 


cc 


cc 


ccaza: 


ccoroccc 


OC 


oc 


oc 


cc 


oc 


cc 






555 




5 


5 


5 


5 


5 


5 








Z Z Z Z 


z 


z 


z 


z 






o 


o 


OOO 


o|z z|o 


o 


o 


o 


o 


Q 


o 


cc 


cc 


ccccoc 


CC CCGCCC 


cc 


cc 


oc 


oc 


cc 


oc 


2 


2 


222 




2 


2 


2 


2 


2 


2 


Q 


o 


OOO 


oooo 


o 


o 


o 


o 


o 


o 


O 


o 


OOO 


oooo 


o 


o 


o 


o 


o 


o 


o 


o 


OOO 


oooo 


o 


o 


o 


o 


o 


o 


o 


o 


OOO 


oooo 


o 


o 


o 


o 


o 


o 


Q_ 


o. 


O.O.CL 


O.Q-D-Q- 




o. 


Ol 


Q. 


o. 


o. 


CC 


cc 


trcccr 


GC CCCCOC 


oc 


oc 


oc 


oc 


cc 


oc 



i— i— i— H> >l*- ' — Hx 1 »- 

LULULULULULUUJLU Uj|x[UJ LU UJ UJ I 



> 



cccc 



COCO • • I— I— • • • 

• - CC O H- h- COCO * • h- H— H- 1 — Q 

<<ZZ UiUJZZQO^^Z 

O Q Q O CO COZ Z ZZZ Z COO) !- 

AMENDED SHEET (ARTICLE 19) 



CD 


O 


LI- 


X 


3 


CO 


CO 


CO 


CO 




FIG. 


O 


O 


O 


o 


LL 


LL 


LL 


Li- 


< 


O 


LU 


S2 




CO 


CO 


CO 


CO 


en 


O 


O 


O 


o 


CD 


u_ 


LL 


U. 


u_ 


LU 



CO 

CD 



CO 

CD 

LL 



PCT/US97/09690 



19/22 




AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



20/22 




FIG 5B MAD(ngzmi) 

AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



21/22 




- i i i i i 1 1 1 1 i i i i i 1 1 1 1 i i i i i 1 1 1 1 i i i i i i 1 1 

1 10 100 1000 



FIG. 5D 



MAb (ng/ml) 
AMENDED SHEET (ARTICLE 19) 



WO 98/01564 



PCT/US97/09690 



22/22 




AMENDED SHEET (ARTICLE !9) 



INTERNATIONAL SEARCH REPORT 



tatarr fat Application No 

PCT/US 97/99690 



A CLASSIFICATION OF SUBJECT MATTER .„,„, 

IPC 6 C12N15/49 C07K14/16 A61K39/21 



Aeeonftia to tntem**»n«l P«t«fil CI«»«ifiMlion QPC) of to both n«pon«l cluitlortion «nd IPC 



B. FIELDS SEARCHED 



Minimum doaunMnUton usntmd (ota»i»orton tyttom lolkwwd by d«»«fio«8on »yn*ol») 

IPC 6 C12N C07K A61K 



OoowiwnUtion M .roh.d ath.. th.n minimum docun-nUbon to th. ..Un. th* .uol. doaum.1*. «. inokid.d in In. MM. »«oh.d 



Eteotrofiicd*- bw »iwdM during*, int.mrtonri iMrch (fwm.oldatabme md. wtwra pmobML ••«<* 



lanra u«*d) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Catogoiy* Ctaoon ot dooumarrt, «Hth indioalion. wh«m 



ipprapriat., ol ttw relevant pawapM 



Relevant to claim No. 



WO 94 28929 A (GENETECH, INC.) 22 December 
1994 

see page 56 SEQ. ID. NO. 25. 
see page 50, line 14 - line 31 

P.W. BERMAN ET AL.: "Genetic and 
immunologic characterization of viruses 
infecting MN-rgpl20 vaccinated volunteers" 
ONE WORLD, ONE HOPE: XI INTERNATIONAL 
CONFERENCE ON AIDS, 

vol. 10, no. supplement 3, 7 - 12 July 

1996, VANCOUVER, CANADA, 

page 10 XP002O45307 

See "Methods" in Abstract Mo. A. 285 



1-18, 
20-25 



1,5,6 



0 



Further document* are listed in tha continuation ot box C. 



0 



Patent family imrnbara arc Hated in onnax. 



• Spac-tf cateoorwa of eitad document* : 

•A* dooumant dafWng tha ganeraJ stata of tha art whwh ia not 

oanaiderad to ba ol parttoular ratovanca 
•E* aariiaf docurnant but pubtithad on or after tha vtfamationa! 

fang data 

•t • dooumant which may throw doubta on priority otaim(«) or 
which is oitad to aatabUah tha publication data of anothar 
ottabon or othar tpaoial raaaon (aa epaoaled) 

•O* documant referring to an oraJ dlectoeura, uee, mxhhtion or 
othar ma ana 

'P* dooumant pubishad prior to tha international Piling data but 
Later than tha priority data otaimad 



T Lttar dooumant publabad after tha international ifiirw date 
or priority date and not in conflict with tha apphatembut 
oitad to undarstand tha prinripie or thaory undarfywig tha 



•X* ooouimnttfpeittoutMf^^ invantion 
cannot pa oonaidarad nova* or oannot ba oontidarad to 
invotva an invaribva etep whan tha document » takan alone 

•Y* dooumant of partiouUr ralavanca; tha daimed invention 

cannot ba oonaibacad to invoke an inventive tap whan tha 
document ia combinad with ona cr mora othar auch dpou- 
rnante, auoh coritoabon bairtg coviot* to a paraon ekitted 
in tha art 

*»• dooumant member of foe aama patent family 



Date of tha aotual oomptehon of tha 'uiUrnaoonaJ 



30 October 1997 



Date of maiang of tha intern eadnal aaaroh report 



2 6 -11- 1997 



Name and mailing addraee of tha ISA 

European Patent Offioa, P. B. 5318 Patentlaan2 
NL-2280HV R<awi>c 
Tel (+31-70) 340-2040, Tx. 31 651 apo nl, 
Fax: (+31-70) 340-3016 



Autftoriiad offloor 



Cupido, M 



Form PCT/tSA/210 (wcond ineet) (Juh/ 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 





Mr <nd Application Ho 

PCT/US 97/09698 


C.(Cootlnuatton) 0OCUMENT9 CONSIDERED TO BE RELEVANT 


Category* 


Cftafon of ooowmnt, wth induction, wh«r» *pprapri«t», dt tho r»l«v«nt pcs»ao«s 


nohwnt to otaim No. 


A 


M.J. MCELRATH ET AL.: "Human 
inmunodeficiency virus type 1 infection 
despite prior ironunization with a 
recombinant envelope vaccine regimen" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF USA, 

vol. 93, no. 9, 30 April 1996, WASHINGTON 

us, 

pages 3972-3977, XPOO2O45308 

see page 3976, last paragraph; figure 1 




19 


T 


P.W. BERMAN ET AL. : "Genetic and 

inmunologic characterization of viruses 

infecting MN-rgpl20-vaccinated volunteers 
ii 

THE JOURNAL OF INFECTIOUS DISEASES, 
vol. 176, no. 2, August 1997, 
pages 384-397, XP0O2045309 
see the whole document 




1-25 



Form PCT71SA/210 (continuation of Moond •hotf) (July 1TO2) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 

•nf ormatlon on patent family mambara 



tn*r ,nal Application No 

PCT/US 97/G9690 



PaJtnt documa/it 
cited in ••arch report 



Publication 
data 



Patent family 
m#mben» 



Publication 
date 



WO 9428929 A 



22-12-94 



All 
EP 



7047894 A 
0708659 A 



03-01-95 
01-05-96 



fam PCWSA/210 {p**nt ten* (Ally 1982) 



